CLASSICS IN MATHEMATICS 


Lars H6rmander 


The Analysis 

of Linear 

Partial Differential 
Operators III 


Pseudo-Differential 
Operators 


go) Springer 


Classics in Mathematics 


Lars Hormander The Analysis of Linear Partial 
Differential Operators III 


Lars HOrmander 


The Analysis of Linear Partial 
Differential Operators III 


Pseudo-Differential Operators 


Reprint of the 1994 Edition 


D) Springer 


Lars H6rmander 


University of Lund 
Department of Mathematics 
Box 118 

SE-22100 Lund 

Sweden 

email: lvh@maths.Ith.se 


Originally published as Vol. 274 in the series: 
Grundlehren der Mathematischen Wissenschaften 


Library of Congress Control Number: 2007900647 


Mathematics Subject Classification (2000): 
35A, G, H, J, L, M, P, S, 47G, 58G 


ISSN 1431-0821 
ISBN 978-3-540-49937-4 Springer Berlin Heidelberg New York 


This work is subject to copyright. All rights are reserved, whether the whole or part of the material is 
concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broad- 
casting, reproduction on microfilm or in any other way, and storage in data banks. Duplication of 
this publication or parts thereof is permitted only under the provisions of the German Copyright 
Law of September 9, 1965, in its current version, and permission for use must always be obtained 
from Springer. Violations are liable to prosecution under the German Copyright Law. 


Springer is a part of Springer Science+Business Media 
springer.com 
© Springer-Verlag Berlin Heidelberg 2007 


The use of general descriptive names, registered names, trademarks, etc. in this publication does 
not imply, even in the absence of a specific statement, that such names are exempt from the relevant 
protective laws and regulations and therefore free for general use. 


Production: LE-TpX Jelonek, Schmidt & Vockler GbR, Leipzig 
Cover-design: WMX Design GmbH, Heidelberg 


SPIN 11912330 41/3100YL -5 43210 Printed on acid-free paper 


Grundlehren der 
mathematischen Wissenschaften 274 


A Series of Comprehensive Studies in Mathematics 


Editors 


M. Artin S.S. Chern J.M. Frohlich E. Heinz 

H. Hironaka_ F. Hirzebruch L. H6rmander 

S. Mac Lane W. Magnus C.C. Moore J.K. Moser 
M. Nagata W. Schmidt D.S. Scott Ya.G. Sinai 
J. Tits B.L. van der Waerden M. Waldschmidt 

S. Watanabe 


Managing Editors 
M. Berger B.Eckmann  S.R.S. Varadhan 


Lars HOrmander 


The Analysis of Linear 
Partial 
Differential Operators II 


Pseudo-Differential Operators 


Springer-Verlag 
Berlin Heidelberg NewYork Tokyo 


Lars Hérmander 


Department of Mathematics 
University of Lund 

Box 118 

S-221 00 Lund, Sweden 


Corrected Second Printing 1994 
With 7 Figures 


AMS Subject Classification (1980): 35A, G, H, J, L, M, P, 8S; 47G; 58G 


ISBN 3-540-13828-5 
Springer-Verlag Berlin Heidelberg New York Tokyo 
ISBN 0-387-13828-5 
Springer-Verlag New York Heidelberg Berlin Tokyo 


Library of Congress Cataloging in Publication Data 

(Revised for vol. 3) 

Hérmander, Lars. 

The analysis of linear partial differential operators. 

(Grundlehren der mathematischen Wissenschaften; 256-_) 

Expanded version of the author’s 1 vol. work: Linear partial differential operators. 
Includes bibliographies and indexes. 

Contents: 1. Distribution theory and Fourier analysis - 2. Differential operators with constant 
coefficients - 3. Pseudo-differential operators. 

1. Differential equations, Partial. 2. Partial differential operators. J. Title. II. Series. 
QA377, H578 1983 515.7242 83-616 

ISBN 0-387-12104-8 (U.S.: v. 1) 


This work is subject to copyright. All rights are reserved, whether the whole or part of the 
material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, 
reproduction by photocopying machine or similar means, and storage in data 

banks. Under §54 of the German Copyright Law where copies are made for other than private 
use a fee is payable to “Verwertungsgeselischaft Wort”, Munich. 


© Springer-Verlag Berlin Heidelberg 1985 
Printed in Germany 


SPIN 10427377 41/3140 - 5 43 2 1 — Printed on acid-free paper 


Preface 


to Volumes III and IV 


The first two volumes of this monograph can be regarded as an expansion 
and updating of my book “Linear partial differential operators” published 
in the Grundlehren series in 1963. However, volumes III and IV are almost 
entirely new. In fact they are mainly devoted to the theory of linear 
differential operators as it has developed after 1963. Thus the main topics 
are pseudo-differential and Fourier integral operators with the underlying 
symplectic geometry. The contents will be discussed in greater detail in the 
introduction. 

I wish to express here my gratitude to many friends and colleagues who 
have contributed to this work in various ways. First I wish to mention 
Richard Melrose. For a while we planned to write these volumes together, 
and we spent a week in December 1980 discussing what they should 
contain. Although the plan to write the books jointly was abandoned and 
the contents have been modified and somewhat contracted, much remains of 
our discussions then. Shmuel Agmon visited Lund in the fall of 1981 and 
generously explained to me all the details of his work on long range 
scattering outlined in the Goulaouic-Schwartz seminars 1978/79. His ideas 
are crucial in Chapter XXX. When the amount of work involved in writing 
this book was getting overwhelming Anders Melin lifted my spirits by 
offering to go through the entire manuscript. His detailed and constructive 
criticism has been invaluable to me; I as well as the readers of the book 
owe him a great debt. Bogdan Ziemian’s careful proofreading has eliminated 
numerous typographical flaws. Many others have also helped me in my 
work, and I thank them ail. 

Some material intended for this monograph has already been included in 
various papers of mine. Usually it has been necessary to rewrite these 
papers completely for the book, but selected passages have been kept from a 
few of them. I wish to thank the following publishers holding the copyright 
for granting permission to do so, namely: 

Marcel Dekker, Inc. for parts of [41] included in Section 17.2; 

Princeton University Press for parts of [38] included in Chapter XXVII; 
D. Reidel Publishing Company for parts of [40] included in Section 26.4; 
John Wiley & Sons Inc. for parts of [39] included in Chapter XVIII. 
(Here [N] refers to H6rmander [N] in the bibliography.) 

Finally I wish to thank the Springer-Verlag for all the support I have 
received during my work on this monograph. 


Djursholm in November, 1984 Lars Hérmander 


Contents 


Introduction. a. 2.04 eee hk Ea we ee SE es 1 
Chapter XVII. Second Order Elliptic Operators ........2.. 3 
SUMMALY <4 on oh 4 4 Bek oe Re ck de oe wee ee a 3 
17.1. Interior Regularity and Local Existence Theorems .... . 4 
17.2, Unique Continuation Theorems ..........2... 9 
17.3. The Dirichlet Problem ...............0.04 24 
17.4. The Hadamard Parametrix Construction ......... 30 
17.5. Asymptotic Properties of Eigenvalues and Eigenfunctions .. 42 
NOTES, 2 ns uk hee SR, WLR ee ee Sg ee Be RO 61 
Chapter XVIII. Pseudo-Differential Operators ........... 63 
Summary? <6 2tdce 8 Ae a Mb we EA aS fe A SS 63 
18.1. The Basic Calculus... 2... 65 
18.2. Conormal Distributions .. 2... 2... .....20040. 96 
18.3, Totally Characteristic Operators ...........0.04 112 
18.4. Gauss Transforms Revisited ............2.4, 141 
18.5. The Weyl Calculus... ... 2... 2... ee ee 150 
18.6. Estimates of Pseudo-Differential Operators ........ 161 
INOTES®. hires a es te ns Be ak Mae tee aero at «ive Geter ob, 8 178 
Chapter XIX. Elliptic Operators on a Compact Manifold Without 
Boundary’: & 008s ewe ON OA ee eT aoe OE oe es 180 
Summary” 20-9 6 ae Se we ee eel a eat ee 180 
19.1. Abstract Fredholm Theory ............... 180 
19.2, The Index of Elliptic Operators ...........4. 193 
19.3. The Index Theoremin RR” .............00.8. 215 
19.4. The Lefschetz Formula .. 2... 2.2... 2. 1 eee 222 
19.5. Miscellaneous Remarks on Ellipticity .........4.. 225 
Notes og eta So ew ho 208 dk eR gh ek Blarch QL ew 229 


Chapter XX. Boundary Problems for Elliptic Differential Operators . 231 


SUMMALY oe wee ee EE ee a Re ees PO ea 231 
20.1. Elliptic Boundary Problems .............4.. 232 


Contents VI 


20.2. Preliminaries on Ordinary Differential Operators . .... . 251 
20.3. The Index for Elliptic Boundary Problems ......... 255 
20.4. Non-Elliptic Boundary Problems .........2.2... 264 
NOLES = aioe ge: Syst ca oP ah leh sap eas, “cee Nal de a a to a cle ee ee ee 266 
Chapter XXI. Symplectic Geometry .............4.. 268 
Summary. oc: %. 4a 6 ge Bee ee GD ek Sr Ae ee age 2 268 
21.1. The Basic Structure... ..........- 2205 269 
21.2. Submanifolds of a Sympletic Manifold ........2.. 283 
21.3. Normal Forms of Functions ............... 296 
21.4. Folds and Glancing Hypersurfaces ........2.2.2.. 303 
21.5: Symplectic Equivalence of Quadratic Forms ........ 321 
21.6. The Lagrangian Grassmannian .........2.2... 328 
NOLES: oan PP SA Se See a Ge eh Ge et eas 346 
Chapter XXII. Some Classes of (Micro-)hypoelliptic Operators . . . . 348 
Summary . 2... ee ee ee Plex bitten wat ne nian 348 
22.1. Operators with Pseudo-Differential Parametrix ....... 349 
22.2. Generalized Kolmogorov Equations ........... 353 
22.3. Melin’s Inequality 2... 2... 2... eee ee 359 
22.4. Hypoellipticity with Loss of One Derivative ........ 366 
INOTES! 6 tes Se Se ee WE he, wh Ree eat Pe 383 
Chapter XXIII. The Strictly hyperbolic Cauchy Problem ..... . 385 
SUMMATY”. xi Me ea sat aes A a ee a Rh ae S, eet 385 
23.1. First Order Operators, . 20... 2.0.7; Sos Ges tb RASS aa sere 385 
23.2, Operators of Higher Order . 2... ... 2... ue. 390 
23.3. Necessary Conditions for Correctness of the Cauchy 
Problem... ok eh a SA ee Ro aD Bae he Bl ae hs 400 
23.4. Hyperbolic Operators of Principal Type... ....... 404 
NOLES? fea MO she uae eh tet, te i Bi a the Te te te eh Gol 414 
Chapter XXIV. The Mixed Dirichlet-Cauchy Problem for Second Order 
OPEFAtOrs:s ce sf - aimnas es ae te che cs A a RL Rea, a Aas wee dar A 416 
Summary? ce. 4026 As oe Oe cg ak ae alae Bd YAN A eee, eS 416 
24.1. Energy Estimates and Existence Theorems in the 
Hyperbolic Case... 2. 1. es 416 
24.2, Singularities in the Elliptic and Hyperbolic Regions ... . 423 
24.3. The Generalized Bicharacteristic Flow ......... . 430 
24.4. The Diffractive Case 2... 2... ee ee 443 
24.5. The General Propagation of Singularities .......2.2. 455 
24.6. Operators Microlocally of Tricomi’s Type ......... 460 
24.7. Operators Depending on Parameters ........2.2.. 465 


VIII Contents 


Appendix B. Some Spaces of Distributions .........2.02., 471 
B.1 Distributions in R" and in an Open Manifold ....... 471 
B.2. Distributions in a Half Space and in a Manifold 

with Boundary ...... 1... ee ee ee ee ee 478 

Appendix C. Some Tools from Differential Geometry .. ..... . 485 
C.1. The Frobenius Theorem and Foliations .......2... 485 
C.2. A Singular Differential Equation . . 2... ......., 487 
C.3. Clean Intersections and Maps of Constant Rank ...... 490 
C4. Folds and Involutions ...............04 492 
C.5. Geodesic Normal Coordinates .............. 500 
C.6. The Morse Lemma with Parameters ........... 502 
INOteSé in. a ks Bae ee, BS ee Se ce Ne a pb te ee Ba eS 504 

Bibliography . 2... 1. 505 

Index. . 2 Bee BO a ee es ae Re es He ek, Oh he os 523 

Index of Notation ©... 2... 525 


Introduction 


to Volumes III and IV 


A great variety of techniques have been developed during the long history 
of the theory of linear differential equations with variable coefficients. In 
this book we shall concentrate on those which have dominated during the 
latest phase. As a reminder that other earlier techniques are sometimes 
available and that they may occasionally be preferable, we have devoted the 
introductory Chapter XVII mainly to such methods in the theory of second 
order differential equations. Apart from that Volumes III and IV are in- 
tended to develop systematically, with typical applications, the three basic 
tools in the recent theory. These are the theory of pseudo-differential oper- 
ators (Chapter XVIII), Fourier integral operators and Lagrangian distri- 
butions (Chapter XXV), and the underlying symplectic geometry (Chapter 
XXI). In the choice of applications we have been motivated mainly by the 
historical development. In addition we have devoted considerable space and 
effort to questions where these tools have proved their worth by giving 
fairly complete answers. 

Pseudo-differential operators developed from the theory of singular in- 
tegral operators. In spite of a long tradition these played a very modest role 
in the theory of differential equations until the appearance of Calderén’s 
uniqueness theorem at the end of the 1950’s and the Atiyah-Singer-Bott 
index theorems in the early 1960’s. Thus we have devoted Chapter XXVIII 
and Chapters XIX, XX to these topics. The early work of Petrowsky on 
hyperbolic operators might be considered as a precursor of pseudo-differen- 
tial operator theory. In Chapter XXIII we discuss the Cauchy problem 
using the improvements of the even older energy integral method given by 
the calculus of pseudo-differential operators. 

The connections between geometrical and wave optics, classical me- 
chanics and quantum mechanics, have a long tradition consisting in part of 
heuristic arguments. These ideas were developed more systematically by a 
number of people in the 1960’s and early 1970’s. Chapter XXV is devoted to 
the theory of Fourier integral operators which emerged from this. One of its 
first applications was to the study of asymptotic properties of eigenvalues 
(eigenfunctions) of higher order elliptic operators. It is therefore discussed in 
Chapter XXIX here together with a number of later developments which 
give beautiful proofs of the power of the tool. The study by Lax of the 
propagation of singularities of solutions to the Cauchy problem was one of 
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the forerunners of the theory. We prove such results using only pseudo- 
differential operators in Chapter XXIII. In Chapter XXVI the propagation 
of singularities is discussed at great length for operators of principal type. It 
is the only known approach to general existence theorems for such oper- 
ators. The completeness of the results obtained has been the reason for the 
inclusion of this chapter and the following one on subelliptic operators. In 
addition to Fourier integral operators one needs a fair amount of symplectic 
geometry then. This topic, discussed in Chapter XXI, has deep roots in 
classical mechanics but is now equally indispensible in the theory of linear 
differential operators. Additional symplectic geometry is provided in the 
discussion of the mixed problem in Chapter XXIV, which is otherwise 
based only on pseudo-differential operator theory. The same is true of 
Chapter XXX which is devoted to long range scattering theory. There too 
the geometry is a perfect guide to the analytical constructs required. 

The most conspicuous omission in these books is perhaps the study of 
analytic singularities and existence theory for hyperfunction solutions. This 
would have required another volume - and another author. Very little is 
also included concerning operators with double characteristics apart from a 
discussion of hypoellipticity in Chapter XXII. The reason for this is in part 
shortage of space, in part the fact that few questions concerning such operators 
have so far obtained complete answers although the total volume of results 
is large. Finally, we have mainly discussed single operators acting on scalar 
functions or occasionally determined systems. The extensive work done on 
for example first order systems of vector fields has not been covered at all. 
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Summary 


The study of differential operators with variable coefficients has led to the 
development of quite elaborate techniques which will be exposed in the 
following chapters. However, much simpler classical methods will often 
work in the second order case, and some results are in fact only valid then. 
Moreover, second order operators (or rather related first order systems) play 
an important role in many geometrical contexts, so it seems natural to 
exploit the simplifications which are possible for them. However, the well 
motivated reader aiming for the most high powered machinery can very 
well skip this chapter altogether. 

Elliptic operators are of constant strength so the results proved in 
Chapter XIII are applicable to them. The perturbation arguments used in 
Chapter XIII are recalled in Section 17.1 in the context of elliptic operators 
with low regularity assumptions on the coefficients and with I? or Hélder 
conditions on the solutions. However, we shall not aim for such refinements 
later on since their main interest comes from the theory of non-linear 
differential equations which is beyond the scope of this book. 

Section i7.2 is mainly devoted to the Aronszajn-Cordes uniqueness 
theorem stating in particular that if 


¥. a,(x) Dtu=0 

lal $2 
is an elliptic equation where a, are real valued Lipschitz continuous func- 
tions for |«|=2 and a, are bounded for |a|<2, then u vanishes identically if 
u vanishes of infinite order at some point. No such result is true for 
operators of higher order than two although there are weaker uniqueness 
theorems concerning solutions vanishing in an open set (see also Chapter 
XXVIII. In this context we also return to the uniqueness theorems of 
Section 14.7 where we now allow first order perturbations. 

In Section 17.3 we study the simplest classical boundary problem, the 
Dirichlet problem, consisting in finding a solution of du=f with given 
boundary values. When the coefficients are constant and the boundary is flat, 
a reduction to the results of Section 17.1 is obtained by a simple reflection 
argument. As in Section17.1 we can then use perturbation methods to 


4 XVII. Second Order Elliptic Operators 


handle variable coefficients and a curved boundary. Thus the boundary is 
flattened, coefficients are frozen at a boundary point, the norm of the error 
then committed is estimated, and a Neuman series is applied. Obviously no 
good information on the singularities of solutions can be obtained in that 
way. In Section 17.4 we therefore present the Hadamard parametrix method 
which exploits the simple form of a second order operator in geodesic 
coordinates to describe the singularities of the fundamental solution with 
arbitrarily high precision. This method is in fact applicable to all second 
order operators with real non-degenerate principal symbol. It can also be 
applied to the Dirichlet problem although with considerable limitations due 
to the possible occurence of tangential or multiply reflected geodesics. 

In Section 17.5 we combine the results of Sections 17.3 and 17.4 to a 
study of the asymptotic properties of eigenfunctions and eigenvalues of the 
Dirichlet problem. First we prove the precise error estimate of Avakumovité 
away from the boundary. A fairly precise analogue at the boundary is given, 
but one component of the proof cannot be completed until Chapter XXIV. 
Further refinements will be given in Chapter XXIX. 


17.1. Interior Regularity and Local Existence Theorems 


Despite the title of the chapter we shall here study a differential operator 
P(x,D)= ) a,(x) D* 
Jal Sm 
of arbitrary order m in an open set X<IR". We assume that for some 
pe(l, 0) 
(i) a, is continuous when |a|=m; 
(ii) P,(0,D)= YY a,(0) D* is elliptic; 
aj=m 
(iii) a,eLV"—') if m—|a|<n/p, a,eL2x! for some ¢>0 if 


loc 


m—|a|=n/p, a,eLh,, if m—|a|>n/p. 


loc 


We can then supplement Theorem 13.2.1 as follows: 
Theorem 17.1.1. If (i)-(iii) are fulfilled and X is a sufficiently small neigh- 
borhood of 0, then there is a linear operator E in L?(X) such that 


(17.1.1) DP(X)a fie D* Efe l(X) is continuous if pSqSo and 
1/q21/p—(m—|a|)/n with strict inequality if q= 0; 


(17.1.2) P(x,D)Ef=f, feL?(X), 
(17.1.3) EP(x,D)u=u if ue C%(X). 


Proof. Let p(D)=£,(0,D) and choose Foye’ according to Theorem 7.1.22 so 
that Fo(€)=1/p() when [€|21 and Fye C®. Then it follows from Theorems 
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7.9.5 and 4.5.9 that 
(17.1.4) || D° Fo * line S Cllg lize 
if gel?O@’, 1/q=1/p—(m—|a))/n, q<oo. 


Moreover, D°F,éL,,, if m—|a|>n(1—1/r), for D*F, is essentially homo- 


geneous of degree m—|a|—n> —n/r. 

Let E, be a fundamental solution of p(D). Then F,—E,eC®. If geLl?(X) 
we define go=g in X and go=0 in §X, and set Eog=Eo*go\x. From 
(17.1.4) and the subsequent observations it follows if X is contained in the 
unit ball that 


(17.1.5) [D* Eo SllraxS Ciisllzoxy, EP (X). 


Here 1/q=1/p—(m—|a|)/n when m—|a|<n/p, we choose q=p(p+e)/é with ¢ 
as in condition (iii) when m—|a«|=n/p, and g=oo when m—|a|>n/p (take 
1/r+1/p=1). Now 


P(x, D)Eyg = p(D)E og +(P(x, D)— p(D))E,g=g+Reg, 
Rg= Y (a,(x)—a,(0)) D?Egg + d 4 (x) D* Egg. 


la| =m lal <m 
By Holder’s inequality, (17.1.5) and conditions (i) and (iii), we have 
Rall rea S43 8llzean> geL(X), 


if X is sufficiently small. Thus [+R is then invertible, and E=E,(I+R)~' 
has properties (17.1.1) and (17.1.2) by (17.1.5) and the fact that 


P(x, D) Ef =(I+ R)\U+R)'f =f. 


Finally, if f= P(x, D)u, ue Ce(X), then the unique solution of the equation 
g+Re=f is g=p(D)u, for Egg=u, hence 


p(D)u+ Rp(D)u=p(D)u+ YY (a,(x)—a,(0))D*u+ Y a,(x)D*u 


ja] =m la| <m 
is equal to P(x, D)u in X. This completes the proof. 


If one replaces the L? conditions by Hélder conditions one obtains the 
following theorem instead: 


Theorem 17.1.1’. Assume that for some ye(0,1) the coefficients of P(x, D) are 
in C” in a neighborhood of 0, and that F,,(0, D) is elliptic. If X is a sufficiently 
small ball with center at 0 then there exists a linear operator E in C’(X) such 
that 


(17.1.1) C’(X)a fre D*EfeC(X) is continuous if \a|<m; 
(17.1.2) P(x,D)Ef=f, feC(X); 
(17.1.3Y EP(x,D)u=u_ if ueCP(X). 
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Here C’(X) is the set of all continuous functions in X such that the 
norm 


sup |g(x)|+ sup |g(x)—g(y)IAx—yl 
xeX x,yeX 


is finite. If X has radius r, then a C” extension to the whole space is given 
by 


So(x)=g(>), xeX; 
B(x) = (rx/|x|)(2—|x\/r),  rS|x|S2r; 
Zo(x)=0, |x| >2r. 


The proof of Theorem 17.1.1’ is identical to that of Theorem17.1.1 except 
that g, is defined in this way and that (17.1.4) is replaced by the continuity 
in C’ when |a|=m, which follows from Theorem 7.9.6. We leave the details 
for the reader since the result will never be used here. 


By a slight twist of the proof of Theorem17.1.1 one can prove a loga- 
rithmic convexity theorem for the I? norms of the derivatives which will be 
useful later on. To shorten the proofs we exclude lower order terms now. 
First we prove a lemma. 


Lemma 17.1.2. If P(D) is homogeneous and elliptic of order m, then 
(17.1.6) > At! | D* oll p< C(\| PD) oll p> + A” loll ze) 


jal Sm 


if A>O and D*veL?, \a|<m. 


Proof. Introducing Ax as a new variable instead of x makes A disappear in 
(17.1.6) so we may assume in the proof that A=1. We define F, as in the 
proof of Theorem 17.1.1, thus P(D)F,=6-+a@ where wmeX% Then we have 


D* v= D* Fy * P(D)v—(D* a) * v, 


and (17.1.6) follows since D*wel' and D*F, satisfies the hypotheses of 
Theorem 7.9.5. 


Remark. It follows from the proof that C can be taken independent of P if P 
varies in a compact set of elliptic polynomials of degree m. 


Theorem 17.1.3. Assume that P(x, D) satisfies the hypotheses (i) and (ii) above 
in a compact neighborhood K of 0. Let X <K be an open set, and denote by 
d(x) the distance from xeX to [X. If D*ueL?(X), |x| Sm, it follows then that 


(17.1.7) |[d(x)'*! Du roxy S C((|d(x)” Palx, D)ull cea + (ull zecx)!!™ balled, 
where C is independent of X. 
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Proof. Let B=B(y,R) be a ball with radius R and center yeX with 
d(y)22R. Set x,(x)=x(x—y)/R) with a fixed yeCP(B(O,1)) which is equal 
to 1 in B(0,4). Applying (17.1.6) to P(D)=P,{y, D) and v=y,u gives with 
another C 


y Apim— led f {D*ul? dx 


jal Sm "By, 4R) 
<C( J IP (x, D)ulPdx+e(R) > f [DtulPdx 
B(y,R) |a| =m B(y,R) 
+ Y Rope le) i} Deal? d+ AP” f jul? dx). 
lal <m Bly, R) B(y,R) 


Here we have expanded P(D)(y,u) by Leibniz’ formula and estimated 
x2(P,,(y, D)— P(x, D)) u(x) by means of the modulus of continuity ¢ of the 
coefficients. Thus «(R) 0 when R->0. Now we take A= M/R where M is a 
large constant and multiply by R’”. This gives 


y MPtm—la) Relat J |D*ul? dx 


lalsm B(y, $R) 


<C( § [R™P(x,D)ul?dx+e(R) Y f |R™D*ulP dx 


Bly, R) ja] =m B(y,R) 


+ ¥ Rll f [DtujPdx+MP™ f |ul? dx). 


a} <m B(y, R) B(y, R) 


With some small R, to be chosen later we define 
R(y)=min (Ro, d(y)/2) 


and integrate with respect to R(y)""dy over X. Since |R(x)— R(y)|S|x—y|/2 
it follows if |x —y|<R(y) that |R(y) —R(x)|<R(y)/2, hence 


R(y)/2 < R(x) <3 R(y)/2. 
On the other hand, if |x —y|<2R(x)/5 then |R(y)— R(x)|< R(x)/5 so 


4R(x)/5< R(y)<6R(x)/5. 
Hence 
ff dy/ROY"S$G/2"" fs dy/R(x)" =3" fay, 
xEBY,RO)) jx—y]<2R(x) lyl<i 
f  dy/RoY' 26/6" ff dy/R@)"=3-" ff dy. 
x€B(y,3RO)) [x-y|<2R(x)/5 lyl<t 


With a new constant C independent of Ry it follows that 


y Men iad f [R(x)!*! D“ul? dx 
\a|<m 
<C(JIR(x)"P,(x, Dyul?dx+e(Ry) ¥ J|R(x)"D*ul? dx 
|a| =m 


+ ¥ SIR(x)*! D*ul? dx + MP" | |ul? dx). 


Ja|<m 
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Choose Ro so small that Ce(Ro)<4. When M2 Mp, say, we can then cancel 
the two sums on the right-hand side against half of the left-hand side and 
obtain 


M™— ll R(x)!" D*u || pp S CR)" B(x, D)ull pp + M™ [ul 2»). 


We choose M=M, if ||R(x)"P,(x, D)ull,p><Mllull_p; otherwise we take M 
so that 


M™ |ull p> = || R(X)" F%, D)ull ie, 
which gives (17.1.7). 


Corollary 17.1.4. Assume that P,, satisfies the hypotheses (i), (ii) in a neigh- 
borhood K of 0. If D*ueL? in K~ {0} for |a|<m and 


(17.1.8) j julPdx=O(R®), RO, 
R<|x|<2R 
(17.1.9) IP,(x,D)ulS CY. [D%ul|x|"!-™ in KX {0} 
jal<m 


then it follows if |a|<m that 


(17.1.10) J |R*! DtujPdx=O(R*), RO. 


R«<|xj<2R 


Proof. We can apply Theorem 17.1.3 with X = B(0,2R)~ B(O,R) if R is small. 
Then 


d(x)" |P,(x, D)ul SCY d(x)!*!|D* uj 
t 


al<m 


because d(x) <R<|x|. Hence it follows from (17.1.7) that 


S= y ld" D* ulin Cy Se" a event 


lal<m 
Thus 
(R/3)*! De ulPdx< Yd!" Duley 


lal<m 4R<3 |x| <5R lal<m 
SCT? |lullZo~x) =O(R"), 


which proves (17.1.10) for |a|<m. Another application of (17.1.7) gives 
(17.1.10) when |a]=m also. 


With applications to global existence theory in mind we shall discuss in 
Section 17.2 whether a solution u of a differential equation with principal 
symbol P, must be zero when (17.1.8) is valid for all N (or, equiva- 
lently, if (17.1.10) is valid for all « with |«|<m and all N). We shall then 
have to assume that the coefficients of P, are Lipschitz continuous, that is, 
|a,(x)—a,(y)|S C|x—y|, |a|=m. Then we can define P,(x, D)u in the distribu- 
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tion sense if D‘ueL?, |a|<m, and Theorem 17.1.3 as well as Corollary 17.1.4 
can be improved by means of Friedrichs’ emma: 


Lemma 17.1.5. Let vel?(IR") and let |a(x)—a(y|SM\|x—yl if x,yeR". If 
peCe and o,(x)=(x/e)e~", then 


(171.11) [(@D,v)*¢,—a(D,0*$) lo SM lolz» [Gl +lylID, Play. 
For fixed v the left-hand side tends to 0 when ¢-0. 


Proof. Since C?? is dense in I? we may assume that ve Cf, and it suffices to 
prove (17.1.11) since it is then obvious that the limit is 0. The quantity to 
estimate is 


If (a(x — y)—a(x))(D; »)(x—y) 6, (y) dy 
=|f (a(x — y)—a(x)) o(x — y)D, 6,(y)dy—J(Dja)(x—y) oxy) b,(y) dyl 
SM flo(x—y) (yl 1D, 6. +16. dy. 


(17.1.11) follows now from Minkowski’s inequality since 


Sd. ON +1V1 1D; do.) dy 


is independent of «. 


Let us now return to Theorem 17.1.3 assuming only that D*ueI?(X), 
ja|<m, but that a, are Lipschitz continuous and that P,(x, D)ueL?(X). Let 
Xoo X1€CH(X), xy, =1 in a neighborhood of supp yo, and set v=y u. Then 
veé'(X) and D*vel?, ja|<m, P,(x,D)vel?. Choose ¢eCP with |pdx=1 
and set v,=v*@, where ¢,(x)=(x/e)/e”. Then v,eC? and if b,=x,a, we 
have for small ¢ 

P,(x,D)v,= ) b,D% v,—> P,(x,D)v in IP 
la] =m 

by Lemma 17.1.5 since P,(x,D)v=} b, D*v. Hence we can apply (17.1.7) to 
v,—vs and conclude that D*v, has a limit in I? when ¢-0 if |«|=m. Hence 
D*uelh,,{X) when |a|<m. The estimate (17.1.7) is therefore true if X is 
replaced by {xeEX; d(x)>p}. Letting p>0 we obtain (17.1.7) as it stands. 
Thus Theorem 17.1.3 and Corollary 17.1.4 are valid when a, are Lipschitz 
continuous and Duel, ja|<m. 


17.2. Unique Continuation Theorems 
We shall begin with a unique continuation theorem similar to Theorem 8.6.5 
where operators of higher order are allowed. Let 


P,(x,D)= Y, a,(x) D* 


lal =m 
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be defined in an open set X CIR" and assume 
(i) a, is Lipschitz continuous in X, 
(ii) P, is elliptic in X. 
By 2 we denote the closed conic set 


(17.2.1) 3 = {(x,N) € T*(X)\ 0; Pm(x,€+7N) has a zero 7 of multiplicity 22 
with £+7N #0 for some € € IR"}. 


Of course t cannot be real then. 


Theorem 17.2.1. If D*ueLi,,(X), |a|<m, and P,(x, D)ueL?,,(X), 
(17.2.2) | [P.(x,D)ulSC Y [Dtul in X 


lal <m 


then N(suppu)c2, where © is defined by (17.2.1). 


For the notation N and the global uniqueness results which follow from 
Theorem 17.2.1 we refer to Sections 8.5 and 8.6. The definitions of 2 and of 
N are both local and invariant under local diffeomorphisms so it is suf- 
ficient to prove that if 0EX and (0,N)éz, N=(0,...,0,1) then u=0 in a 
neighborhood of 0 if suppun {x; x,20}< {0}. This will be done by means 
of estimates with respect to high powers of a weight function with maxi- 
mum in the support of u taken at 0 only. 

Set p(€)=P,,(0,¢). Then the hypothesis (0,N)¢ means that p(€+itN) 
and p™(E+itN)=dp(E+itN)/OE, have no common zero (€,t)ER"**~ {0}. 
Thus 


(17.2.3) Yo? FD E81? < C(lp(E+it NN? +27 |pM(E+it N)I); 
|a] Sm 
(¢,t)eIR"*?; 


for both sides are homogeneous of degree 2m and can only vanish if + =0 
and p(€)=0, that is €=0. Next we need an identity of Treves which is 
closely related to the commutation relations. 


Lemma 17.2.2. Let Q(x)=)\a,x,;+)°b,x7/2 be a real quadratic polynomial in 
IR" and let P(D) be a differential operator with constant coefficients. If 
ue CP(IR") and v=ue®? then 


(17.2.4) f |P(D)u\e2ax = f |P(D + iQ! /2)v|? dx 
= [> |P@@ — ig! /2)v|"b*/a! de. 
a 
Proof. The first equality is obvious since D,u=e~%?(D,;+i0,Q/2)v. The 
adjoint of D;+id,Q/2 is D;—i@,;Q/2 so we must show that 


(17.2.5) P(D—iQ'/2) P(D+iQ’/2)=¥, P® (D+ iQ’ /2) P(D —iQ'/2) b*/a!. 
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Now the commutators 
[D ,—i6,Q/2, D,+i6,Q/2]=0,0,Q0=b,6, 
are the same as the commutators of @, and b, x,. Since as operators 
P(d) P(bx) =>. (6* P(b x)) P(a)/a! 
by Leibniz’ rule and this is a purely algebraic consequence of the com- 
mutation relations, it follows that (17.2.5) holds. 


The following is the crucial estimate in the proof of Theorem 17.2.1. 


Proposition 17.2.3. Let P(x, D) satisfy conditions (i) and (ii) above in a neigh- 
borhood of 0 and assume that (0,N)¢2Z. Then there is a neighborhood Xy>< X 
of 0 such that with $(x)=x,+x2/2 we have for small e>0 and large t>0 


(17.2.6) > Trim lal)—1 f [D* ul? e2t¢ dx 


SCJ|P,(ex, D)ul?e?**dx, ueC?(X). 
Proof. If we write v(x)=u(x) e"®™ then 
Du=e-**(D+itd’)v and Dv=e'*(D—it¢')u. 
Apart from the size of the constant, (17.2.6) is therefore equivalent to 


(17.2.6) >, qrim—|al)—1 f |D* v|? dx 


lal Sm 


SCf|P,(ex,D+itd¢’)v\?dx, veC%(X,). 


Assume first that the coefficients of P, are constant, thus P,=p. If we apply 
(17.2.4) with P=p and Q=21 @ it follows that 
(17.2.7) §\p(D—it ob’) v|? dx +21 | |p"(D—it ') vl? dx 

<f|p(D+it gv}? dx. 


By (17.2.3) and Parseval’s formula we have for all ve CP (R") 
(17.2.3) SY 20"— led (1 D* vl? dx < C(f |p(D —it N) v|? dx 


lal Sm 


+1? ([p”(D—itN) v|? dx) 
If ve CP(X 9) and |x|<6 in Xp, it follows from (17.2.3) that 


(17.2.8). 120" l2 f |D* yf? dx <2 C(f |p(D—it ¢') vl? dx 


lal Sm 


+17 | |p(D—it p’) vo? dx)+ C46? 17) Y 12" 1 12D [Dy]? dx. 


jal Sm 
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When 6 is small and t is large we have C’(1+6?17)<1?/2 which allows us 
to cancel the last sum against half of the left hand side. (17.2.6) is then a 
consequence of (17.2.8) and (17.2.7). 

To complete the proof we need an elementary lemma which allows us to 
handle variable coefficients. We denote the L? norm simply by || |. 


Lemma 17.2.4, Let X <IR" be an open set, and let A be a Lipschitz con- 
tinuous function with |A(x)— A(y)| S$ L|x—y| for x, yeX. Then 


[§ A(x<)(D% u(x) D? v(x) — D? u(x) D* v(x) dx|<|a+ B| LM 
if u, ve CO(X) and 
|D* ul] |D* ol] SM — when |a'+f'|<|a+ |, max(|e’|,|B'|)Smax((a, |B). 


Also the last inequality can be taken strict when |a|+|B]. 


Proof. This is obvious when «+ B=0. If |a+f|=1 we just have to note that 


§ A(x) (D ul) v() — u(x) D v(x) dx = — J D; A(x) u(x) v(x) dx. 
An integration by parts also gives the statement when |«|=|8|=1, 
J A(x)(D , u(x) D, v(x) — D, u(x) D v(x) dx = 
— f u(x)(DjA)Dyv@&) — DyA(x)Djv()) dx. 
These two identities allow us to exchange indices between « and f and 


transfer excess derivatives at a cost of LM for each index affected. 


End of Proof of Proposition 17.2.3. Writing P,(0,D)=p(D) and r(x, D) 
=P. (x,D)—p(D) now, we know by hypothesis that the coefficients of 
r(¢x,D) and their Lipschitz constants are O(e) in X. With the notation in 
the first part of the proof we form 


JIP,(ex, D+it $) vf? dx —f|P(ex, D—it¢’) v|? dx. 
Inserting P,=p-+r we first obtain the terms 
Jip(D+it $’) v|? dx —||p(D—it d’) vo? dx 227 f |p"(D—it o’) v]? dx. 


The other terms where no derivative falls on ¢’ are of the form 


com lel-18l f A(x)(D* v(x) DP v(x) — D? v(x) D* v(x) dx; lal Sm, |BlSm; 


where the Lipschitz constant of A is O(e). These terms can be estimated by 
means of Lemma 17.2.4. In addition there are terms of the form 


0” { A(x) D* v(x) D? v(x) dx; v+|al+|Bl|<2m, — |a| Sm, |B| Sm; 
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where sup|A|= O(e). Thus ; 
Pex, D—it gp’) vl]? +7 ||p(D—it p') ov}? 
S||P(ex,D+itg)v]?+Ce Yo 2 -l)-1 Dv? 
lol gm 
If we observe that (17.2.8) remains valid with p(D—it®¢’) replaced by 


P. (ex, D~—it@’) provided that e<6, we complete the proof of (17.2.6) just as 
in the constant coefficient case. 


Proof of Theorem 17.2.1. We recall that it suffices to prove that if 0EX 
and (0,N)¢éz, N=(0,...,0,1) then u=0O in a neighborhood of 0 if 
supp un {x;x, 20} {0}. In doing so we set u,(x)=u(ex) where ¢ is chosen 
so small that (17.2.6) is valid for a neighborhood X,cX/e of 0. Let 
xe CH(X_) be equal to 1 in a neighborhood V of 0, and set U=yu,. If 
P,(x, D)u= f then 
P,(ex,D)U=e" x(x) f(ex)+ > Dt xy Plex, D)u/a! 
O<|lal|<m 


which implies that P,(ex,D) UeL? and that, by (17.2.2), 
|P,(ex,D)UJSC Y em !\|D*U| in V. 


la]<m 
By the remarks at the end of Section 17.1 we have D* UeL? when |a|<m, so 
it is clear that (17.2.6) may be applied to U. If suppy is small enough we 
have @< —c for some c>0 in supp UN V. Hence we obtain using (17.2.6) 
tt YS et*D* UN SC |le* P(ex,D)UI SC’ YD le** D*U + C’e-™. 
lal<m lal <m 
For large t it follows that 
tt Y jle*D*UI S2C"e-*. 


lal<m 


Hence U=0 when ¢> —c, which proves the theorem. 


In the second order case the following lemma shows that the set 2 has a 
very simple description: 


Lemma 17.2.5. Let p be a quadratic form in IR" with complex coefficients 
which is elliptic, that is, p(€)+0 when 0+é€eER". If NeR"\O and 
€eR"~ RN, n+2, it follows that the equation p(€+1N)=0 has one root with 
Imzt>0 and one with Imt<0. When n=2 the roots are distinct unless p is the 
square of a linear form. 


Proof. R"\IRN is connected if n>2. Since p(€+7N) has no real zero if 
€eR"~RN it follows that the number of zeros with Imt>0 is independent 
of € Replacing by —é changes the sign of t also so there must be one 
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zero in each half plane. When n=2 there is a factorization p()=L,(€) L2(€) 
with linear factors L, and L,. They must be proportional if they have a 
common zero; and then they can be chosen equal. 


If m=2 it follows that Y is empty when n>2 and that Y= )( 0) for 
all x such that P,(x,é) is the square of a linear form when n=2. If X is 
connected and P,(x, &) is real for some x then 2 is empty, for the two zeros 
of P(x,€+tN) must remain in different half planes for reasons of con- 
tinuity. 

In what follows we shall only consider the second order case and shall 
then use the notation p(x,D) instead of P,(x,D). We shall prove that if u 
satisfies a weakened form of (17.2.2) and vanishes of infinite order at a point 
where the coefficients are real, then u is equal to 0. 


Theorem 17.2.6. Let p(x, D)= Y a(x) )D,D, be an elliptic operator in a con- 
nected neighborhood X of 0 such that ‘a (0) is real, ay, is continuous in X, 
Lipschitz continuous in X~ {0}, and lal S Clxf?- 1 for some 5>0. If 
D*ueL,,,, |u| <1, and 


(17.2.2) Ip(x,D)ul SCY |x\Pt'!-? |Deul, 
jaf S14 
(17.2.9) J |ul?dx=O(e%), &0, 
Ixl<e 


for every N, then u=0 in X. 


Proof. Since (17.2.2) implies (17.1.9) it follows from Corollary 17.1.4 in the 
extended form discussed at the end of Section 17.1 that for |«| $2 and all N 
(17.2.9)' J |D*ul?dx=O(e"), e090. 
e<|x|<2¢ 

Hence u is the sum of a function in A(X ) and a distribution with support 
at 0. However, no distribution with support at 0 is in I4,, so it follows that 
ue H55(X). By Theorem 17.2.1 it suffices to show that u=0O in a neigh- 
borhood of 0. Without restriction we may assume that p(0, D)= > Dj. 

As in the proof of Proposition 14.7.1 we introduce polar coordinates in 
IR" ~ {0} by writing x=e'@ where teIR and weS"~*. Then we have 


0/0x;=e~'(w,0/0t+Q)) 


where Q; is a vector field in S"~1. With the notation p(x, D)=¥' aq(x) D,D, 
it follows that 


p(x, D)= —e~*¥"a,,(e w)(w (0/dt — 1) + Q,)(w, 4/6t + Q,). 
With U(t,@)=u(e' w) the inequality (17.2.2) can be written 
(17.2.2)" |Ylay(e' o)(w,(6/dt— 1) + Q,)(w, 0/0t+ Q)UISC Y e*|U,| 


lelsi 
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where U,=(w0/dt+Q)*U. By assumption we have a,(e' w)=5,,+O0(e") as 
t-» — oo, first order derivatives are O(e*'), and 
¥ (@(G/6t —1) + Q,)(w, 4/6t + Q,) = 0 ot? + (n— 2) 0/6t+ YQ} 

since )'@,Q,=0 and )'Q,0,=)rdw,/6x,;= Yr 6(x,/r)/0x;=n—1. The oper- 

ator }'Q; is the Laplace-Beltrami operator A, in the unit sphere. The 

adjoint of Q, as an operator in [?(S"~*) is (n—1)w,—Q,. In fact, 

{(Q,u) vdx+fuQ,vdx=JQ,(uv)dx=J |x| O(uv)/@x,dx—J{w,d(uv)/orr"dwdr 
= —fojuvdx+n{ojuvr"—'dwdr 
=(n—1)[w,juvdx. 

In spite of this A,, is of course self-adjoint; indeed, we have 

Y (n—1)@,-2)? =(n-1)? -(-1) 98,04. YO? =F 07. 
In the proof of Theorem17.2.1 the essential estimate (17.2.7) was ob- 
tained from (17.2.4) thanks to the positivity of b,, that is, the convexity of 


the exponent ¢. To obtain a similar effect we introduce for some «¢ with 
0<é<06 a new variable T instead of t, 


t=T+e?; dt/dT=1+ee7>0. 


Note that T<t<T+1<T/2 if T<—2. After multiplication by (1+¢e*?)? 
the operator in the left-hand side of (17.2.2)” becomes 


Q=0/0T? +c(T)G/OT+(1teeTP F0?+ Yc, (Ta)(6/eT¥ A 


lal+js2 
Here c(T)=(n—2)(1+ee'7)—e7 e*7 /(1+ee*") is close to n—2 at — oo, and 
(17.2.10) C,,;=O0(e), dc, ;=O(e") as T> ~— ow. 


(Note that this change of variables is not smooth in the original variables.) 
We shail prove that for some T, 


(17.2.11) y 13-20 +led fF (a/OTY Q* Ul? e~ @*- 8? dw dT 


j+l|als2 
SC§flQU)?e"***dwdT, UeCH(-—~,T)xS"~’). 
(When |a|=2 we define Q* for example as a product 2,2, with j<k.) This 
will serve the same purpose as (17.2.6) did in the proof of Theorem 17.2.1. 
Proof of (17.2.11). Set U=e'™ V and 
Q,V=e""T Qe" V). 
Thus Q, is obtained from Q when 0/6T is replaced by 0/0T+1. Then 
(17.2.11) is equivalent to 
(17.2.11Y Y 37 2U+ led (1 (G/T OF Vi? eT dwdT 


i+ ol <2 


SC§flOQ,ViPdwdT, VeCe(—,T) x S"-'). 
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Let Q; be the operator obtained from Q, when 0/@T and Q, are replaced by 


—0/éT and —Q, while c, , is replaced by ¢,;. (With our present notation 
this is essentially equivalent to the complex conjugation in the proof of 
Proposition 17.2.3.) We shall examine the difference 


(17.2.12) §{1Q, VP. dwdT ff Or V\2dwdT. 


In addition to paying attention to the powers of t and orders of differen- 
tiation as in the proof of Proposition 17.2.3 we must now take the exponen- 
tial decrease at —0oo into account. It will follow from (17.2.10) that the 
terms involving c, , are not important, so we first consider the other terms 


in Q, and Q-, 
(40/0T +1)? +¢(T)(+0/6T +1) +(itee)? YQ}. 
The corresponding contribution to (17.2.12) is 
4Re((0?/OT? +77 +c(T) t+(1 tee?) ) Q5)V, (2t+0)AV/T) 
= —2Re(c'(T) 6V/@T, dV/8T)—2((3 17 +2et)c' V,V) +2) (hQ,V,2,V). 


Here we have used that )’,2,=0, and 
d 
h=—7(l tee?) (2t+c)=((2t+c) 267 e*? +(1 +e" )c’)(1+ee°7) 


22e*teF © 


when t is large enough. All other terms in (17.2.12) are of the form 


7! ff a(T, @)((8/8T)* 0 V (8/6T)"° OV 
—((—0/8T)o(— a V (—/8T)°(— QV) dw dT 


with j+ootlal+Bot+lBIS4, ot+]o/<2, Bo+|B]S2 and a=O(e*"), a’ 
= O(e*"). We can estimate them by an obvious modification of the proof of 
Lemma 17.2.4, which can also be used directly after decomposition by an 
appropriate partition of unity. (Recall that the adjoint of Q, differs from 
—Q, by an operator of order 0.) Hence we obtain 


(17.2.13) QF Vil? +4e? ty ff]Q,V\2 ee? dwdT S||0,V ||? 
+C SY 13> 2U+led (0 a/ATY Ot VI? (eo? +17! e°7) dw dT. 


it+lals2 


This is an adequate substitute for (17.2.7). Instead of (17.2.8) we shall prove 
that for some T, 


(17.2.14) Yt 2G+leb FF a/ATY O* VI? eT dwdT 


j+lels2 
<C§flQ, VI? +107 VI? +27 510, Ve"? daodT, 
Ve C2 ((— 00, To) x S"~ 1), 


17.2. Unique Continuation Theorems 17 


if t is large enough. If we introduce Ve*?/? as a new dependent variable, 


freeze the coefficients at — oo, and drop terms of lower order, we find that it 
is sufficient to prove that 


(17.2.15) St 2G+ led (( (a/ATY Q* Vi? dwdT 


J+ lal $2 
< Cif (\(6/0T +1)? + A,,) VI? +|((0/6T — 1)? + 4,,) VI? 
+17) 2; V |?) dw dT. 


The integral in the right-hand side is equal to 
ff (2\(67/0T? +27 +. A,) VI? +82? |OV/OT|? +127 )[Q, V|?)dwdT. 
Furthermore 


(6? /OT? +17 +A) Vil? = |0? V/OT? |? +24 VII? + 4, VII? 
+2 |0/ATQ,VI?-27°O 2; V||?+ eV /eT ||). 
By the ellipticity of 4,, we have 
| d [AVP SCl4.VI2+ Yo ary?) 
ja] = 2 Ja} sa 
If we combine these estimates we obtain (17.2.15), hence (17.2.14). Using 
(17.2.13) to estimate the right-hand side of (17.2.14) we obtain when 7 is 
large enough 
y 4 - 20+ lal it} \(a/eTY Q* Vi? (et — Cet +17 1 eT )/e?)dwdT 
i+lals2 
S$ Cr/2e |Q,V|\?. 
When 7 is sufficiently large and e?° is sufficiently small, the estimate 
(17.2.11)' follows. 


End of Proof of Theorem 17.2.6. First recall that the function u in the 
theorem satisfies the differential inequality (17.2.2)” when considered as a 
function U of w and t. When we take t=T+e*? the inequality is replaced 
by 


(17.2.2) joulsc ¥ &TU, 


jal $4 


where U,=|U| and U,=|éU/0T|+|Q*U| when |a|=1. Choose weC*(R) 
equal to 1 in (— 00, T, —1) and 0 in (Tp, 00), and set 


U"(T, o)=/(T) U(T,@). 


It follows from (17.2.9) that the derivatives of UY of order <2 multiplied by 
e-*T are in L? for any N. By cutting u off for large negative T and 


regularizing we conclude that (17.2.11) is valid for UY. The right-hand side 
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can then be estimated by 


C' (e728 Fo- Dy 3 ff [UY e- 27-9 dwdT) 
laj 1 


where U¥=|U"| and U¥ =|dU%/dT|+|Q*U% if |a|=1. Hence (17.2.11) gives 
for large t, since S26, 


S29 -2U+ led (f |(G/ATY Q*UYP e~ 2-9 dwdT S2C"e~**(Ty— 1). 


i+ lal $2 


When t- 00 it follows that U=0 when T<T,—1. Thus the function u in 
Theorem 17.2.6 vanishes in a neighborhood of 0. The proof is complete. 


As we saw in Chapters X and XIII a major application of uniqueness 
theorems is the proof of global existence theorems. We shall give another 
example here using Theorem 17.2.1. 


Theorem 17.2.7. Let a,,(x) be Lipschitz continuous in an open set X <R’ (or a 
C? manifold), Gj = Ay, and assume that (Re aj(x)) is positive definite. Then 
(17.2.16) YD lay.D.4) =f 


umEles ueH(X) if ueL>,(X), which makes the equation defined, and 


tf €Li.<(X). Moreover, the equation has a_ solution ueH35(X) for every 
fe Lyoe(X). 
Proof. 1) To prove the regularity statement we first show that 

ueL,.(X), YD (a;,D,w eH (X) > ue Hi5(X). 


‘loc 
This statement has the advantage that the hypotheses remain valid if we 
replace u by yu where ye C>(X). In the proof we may therefore assume that 
ueé'(X). Let K cX be a compact neighborhood of supp u. If ve CP (K) then 


(17.2.17) YJ iDjvl? dx CRef Yay(x)D,vD,v dx 
=CRef )\(D,a;,(x) D,v)idx. 


As at the a of Section 17.1 we choose v=u,=u*o, where PEC%, | pdx 
=1 and ¢,(x)=(x/e)e~". By Lemma 17.1.5 we ‘know that 


a;,D,u,—(a;,D,u) * >, 
is bounded in I? when 0. Hence 
f.=¥Dj(aj,D,u.) =, (Djaj,D,u) *$,+ YD a,D,u,-—( aj,D,u) * &,) 
is bounded in H,_,). By (17.2.17) it follows that 
ugltyS C1 YD jwell? S Coll Filla llelleay 


which implies that |ju,|4,)S C2] fll(_1) 1s bounded. Thus ueH,,). If fel it 
follows as we saw at the end of Section 17.1 that ueH,). 
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2) To construct u it is enough to show that 
(17.2.18) (LPISIM DY DjG,D. Plz, beCo(X) 


for some positive continuous function M. In fact, by the Hahn-Banach 
theorem it follows then that for some gel? 


(f,¢)=(g,M > D,a;,D,¢), gece (X), 


which means that u= Mg satisfies the equation (17.2.16); by the first part of 
the proof u is in H/3. If K is a compact subset of X we also know from the 
first part of the proof that 


Ilan Cll YD; G.Dib|l— 1), peC3(K) 


which of course implies (17.2.18) if M>C fll) and supp @cK. As in 
the proof of Theorem 10.7.8 the only problem is to increase the compact set 
K without increasing M much on a somewhat smaller set. Let K,,K2,... be 
a sequence of compact sets with union X, each contained in the interior of 
the following one and chosen so that X ~K, has no component which is 
precompact in X. Let M be a function such that (17.2.18) holds when 
~eCo(K,) for some j>2, and let e>0. Then we claim that (17.2.18) remains 
valid when @e€Co(K;41) for some M such that M=(i+e)M in Kj_2. 
Taking a sequence ¢; with ake +8;)<0o we conclude from this that there is 
a function M such that (17.2.18) is valid. 

If the claim were false then (17.2.18) would be false with M2>(1+e)M 
everywhere and M very large on [K j-1. 80 we can find a sequence 
bye Co (K+) with 


(féoy=l, (+9IMYD4,D,oyll2S1, 
ID Dap DP. Onllrex,- ys 1/N. 


By the first part of the proof the sequence dy remains bounded in H,,) as 
N- oo so it has a subsequence converging in H,,, to a limit & with 


(f,®)=1, (1t+e)||M>D aD, Pll,2S1, 
supp@cK,,;, }Da,D,P=0 in §K,, 

By hypothesis every component of X\K,_, contains points outside the 
compact set K,,,. Hence it follows from Theorem 17.2.1 that 6=0 there, so 
supp@c K,_,. By the first part of the proof ¢H,,). If we regularize 6 we 
obtain functions in Cp(K,) violating the assumption that (17.2.18) is valid in 
Co(K,), which completes the proof. 


We shall now prove an extension of Theorem14.7.2 which gives a 
uniqueness theorem for a perturbation of the Laplacean at infinity rather 
than at 0. After passage to polar coordinates the problem will be very close 
to that discussed in Theorem 17.2.6. We assume now that a,, are Lipschitz 
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continuous and that 
(17.2.19) la(x)—5 1S CAx|**%, — \ai,(x)|S C/|x|?*? 


in a neighborhood of infinity. Thus p(x,D)=) a x(x) D ,D,, approaches minus 
the Laplacean at infinity; any other homogeneous elliptic operator of sec- 
ond order with real coefficients could of course be used as well. 


Theorem 17.2.8. Let X be a connected neighborhood of oo in IR" where 
(17.2.19) is valid, and let (1+|x|)'D*ueL7(X) for all t when |a|<1. If A>0 
and 


(17.2.20) [p(x,D)u—Au|<C|x|-! ¥ |Dtul, xe X, 
Jaf 1 


it follows that u=0. 


Proof. As in the proof of Theorem 17.2.6 we introduce polar coordinates 
x =e'w where teR and weS"~', and obtain with U(t, a) =u(e'w) 
(17.2.20! |}. aj,(e'w)(w (0/At — 1) + Q)(w,4/dt + Q,) U + 2e7*U| 
Sc y eo 
lal <1 
where Ug = (wO/dt + 2)°U. By assumption we have ay(e'w) = 5%, + O(e7'-*) 
as tf ~—» +00, first order derivatives are also O(e~'~*), and at infinity the sum is 
(67/6t? +(n—2)0/6t+A,)U 


as before. Since we work with large positive t now we set t= T—e~*? for 
some «€(0,6), which is legitimate since dt/dT =1+«ee-*7 >0. When t>0 we 
have T>0, hence |t— T|<1. After multiplication by (dt/dT)* the operator in 
the left-hand side of (17.2.20)' becomes 


Q=07/0T? +c(T)d/OT+(1+ee-7PA,+ Yc, {Tw)(@/OTYQ*+A(T) 


lal+js2 
where c, ;= O(e~7~°"), de, ;=O(e~7~°"), and 
c(T)=(n—2)(1+ee-*7) +67e" 7/1 +ee-*), 
A(T) = Ae77 e- 2" "(14+ ee-*7)?. 
Note that 4’'(T)2 Ae?" for large T: We shall prove that for large T, and t 
(17.2.21) eM +4-l) De UI]? 


Jal s1 


SCle“77 +2-ile"* QUI, UECH(T, 00) x S"~’). 


Here D* denotes any product of |a| factors 2; and 0/0T. In view of Fried- 
richs’ lemma it follows as in the proof of Theorem 14.7.2 that (17.2.21) may 
be applied to the function U in (17.2.20) multiplied by a cutoff function of 
T which is 0 for T<7,+1 and 1 for T>T,+2. When T, is chosen large 
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enough we obtain as before when too that U=0 when T>T,+2; thus u 
=0 in X by Theorem17.2.1. What remains is therefore to prove the es- 
timate (17.2.21). 
Set V=Ue"? and 
Q,V =e" Q(Ve-*7) 


which is the operator obtained when 6/0T is replaced by d/OT—t in Q. 
Then (17.2.21) is a consequence of the estimate 


(17.2.21Y > |(r+et)~'*lpev\\? 


lal $1 


SCle-T? +c-\1O, V7, VECo(To, 0) x S"~"), 


apart from the size of the constant. Denoting by Q;, the operator obtained 
from Q, when 0/0T and Q, are replaced by —0/0T and —Q, while c, ; is 


replaced by c, ;, we shall again consider the difference |Q,V||?—|Q7 V||? in 
order to prove the following analogue of (17.2.13) 


(17.2.22) Or VI? +4e?ty flew" 0,V |? +2 Alle™V ||? S10, VI? + CR(V), 

where VEC®((Ty, 00) x S"~ +), Ty is large, and 

(17.2.23) RWV)= ¥ lle"TD*V|2/c+ ¥ (e727 D*V 274 ||D*V|I?)r2- 24, 
Jal = 2 lal s 


al S 
First assume that all c, , vanish. Then Q,=L,+L,, Q7 = L,—L, where 
L,=0?/0T? +17 -c(T)t+(lt+ee- "7A, +A(T); L,=—(2t-c(T))0/OT. 
L, is symmetric and the adjoint of L, is —L.—c’(T). Hence 
|,Vil?— lO; VI? =2(L, YL, V)+2(L, YL,V)=2(Ly, L214 V) 
—2(c'L,V,V). 
Computing the commutator [L,,L,] and using the definition of L, we find 
that the right-hand side can be estimated by CR(V) apart from the terms 
(—4te7e"*"(1+ee-*)A, VV) +(Qt—c(T)) A(T) —2¢' A(T))V, V). 


Since 1'(T)2Ae?? for large T we obtain (17.2.22) in this case even with 4 
replaced by 3//2 say in the last term on the left. If the coefficients c, , are 
not all 0 then we have three other types of terms to consider: 


a) The crossproducts involving 4(T) and c, ; are 
2Re Y'((c,, (0/8T— 19 WV, A(T) V)—(A(T)V, c,,;( — 6/6T— tP(— Q)V)). 


After an integration by parts as in Lemma 17.2.4 we find that these can be 
estimated by 


Cife™ (V2 + ¥ [D*V||V/)dwdT S|e™VI2+C Y, 2-21 D* VI? 
laf=1 Jaf s1 
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The last sum can be estimated by means of R and the preceding term can 
be cancelled against the extra At||e7V||?/2 which we had in the left-hand 
side in the “unperturbed case”. 

b) After a similar integration by parts, crossproducts between c, , and 
some other term than A(T) in the unperturbed operator can be estimated by 


CY fe +9 iD V| [DP V| dwdT 


where j+jo/+[f|$3 and |a|<2, |B|S<2 in the sum. If |«|/=2 we can write the 
integrand as t~1/?|e~TD*V|1/+1/2 |e-®T DFV|, where j+|B|<1, so the in- 
tegral can be estimated by CR(V). The same is obviously true when {| =2. 
If jajsi and {[f~[S1 we can estimate the integrand by 
tti—le—4T |p| 71—lBle—8T IDF VY], for 5 may be assumed <1, and the 
integral can therefore again be estimated by CR(V). 

c) Terms containing two coefficients c,, are smaller for large T than 
those examined in b) so the same estimates hold for them. This completes 
the proof of (17.2.22). 

Next we shall estimate the T derivative of V which is missing in the left- 
hand side of (17.2.22). To do so we observe that 


(Q.—O, )V/2=L,V= —(2t —¢(T)) 0V/6T 


when all Cy,j are equal to 0; otherwise terms which can be estimated by 
¥ e774 +972—lel/D*V| may occur. Hence 


C|aV/AT |? S10, VI2+ Or VIP+C Y fe“ T+9 DVI) 4- 2H 
lal 32 


If we divide by t and add to (17.2.22) we obtain, with a new constant of 
course, 
(17.2.24) et} lle*77 QV |? +7 OV/T ||? +Ac|e7 V2 SC(|O,VII7+RV)). 
Finally we shall prove an analogue of (17.2.3). It is complicated by the 
fact that Q, contains the exponentially large term 4(T) which has to be cut 
down to a size which can be controlled by the term At||e7V||? in (17.2.24). 
In the following lemma the right-hand side contains also the other terms 
from the left-hand side of (17.2.24), with A playing the role of e?. The left- 
hand side contains what is needed to introduce variable coefficients, es- 
timate the essential contribution to the error term R in (17.2.24), and finish 
the proof of (17.2.21). We denote by A’ the Laplace operator in x’ 
=(Xj,40%,.,)ER™*: 


Lemma 17.2.9. For ue C@(IR") and large positive A and t we have 
(17.2.25) Ku* 9) cf 2 Deul? +47 °c? ul? +MY) ||Dtul|2<?~ 74! 
ja] $1 


lei s2 


n—1 
<C (K-11(4'+(@/0x4—2)?)ul? +24  IDju}? +24? 4ul?) 
provided that : 
(17.2.26) A-*t?<73/K+42, 13/KSA~*t?+A2,  M?K<tA*, MSA?-#?, 
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Proof. By Parseval’s formula the estimate is equivalent to one of the form 


(172.25) ((€|?+22)?/K + A~*t? + M(lé? +27) 
SCE)? 2? + 2itE,)2/K + 1A 16/2 +72). 


We distinguish two cases: 
a) If |[@/c|? -1]21/2 or [E,/t| 21/2 then 


(EP? + 27? — 2? + 2it d= (e/a? + DE/e)? —1 + 2i¢,/7| 


is bounded so the estimate of the first term in (17.2.25) is obvious. The 
estimate of the second term follows since A~*t?<1t*/K+71A’, and that of 
the third term follows from the inequality between geometric and arithmetic 
means since M? <1 A2/K. 
b) If [|é/t|? —1/S1/2 and |€,/t]<1/2 then |¢|?=1?/2 and the estimate is 
equivalent to 
tIK+ At? +M1?<C(13A7§4+7A?). 


The estimate of the first term follows from the second inequality (17.2.26), 
that of the second term is trivial, and the estimate of the third term follows 
from the last condition (17.2.26). The proof is complete. 


Proof of Theorem 17.2.8 continued. If we take w,,...,@,_, a8 coordinates on 
the unit sphere in a neighborhood of (0,...,0,1) then Q, differs from 
Ai, +(6/8T —1)? by A(T), by an operator with coefficients O(|w’|+e7*7), and 
by an operator of first order in t and D. If 1/e<|e7/A|<e, T2Tp, and |w’| 
is small enough in supp V, it follows from (17.2.25) that 
(17.2.27) ¥ |K~?D V2 +03 eo? V7 +MY |D*V 227-74 

Jal =2 la] $1 


SC(|K~*7Q,V\? +t) le*7? QV ||? +2 Je” VI”) 
if in addition to (17.2.26) we have 
(17.2.28) A/K<t 


which implies that the term A(T)V in Q,V can be absorbed in the last term. 
For the proof we just have to observe that the other perturbing terms in Q, 
can be cancelled against half of the first sum in the left-hand side of 
(17.2.25) with |a|<2 still. To satisfy (17.2.28) and respect the lower bounds in 
(17.2.26) with K as small as possible we choose 


K=A*t if A?*+*<1?, 
(17.2.29) K=13/A? if A?<t?SA?+s 
K=A*/t — if 1?<A?. 
The first two inequalities (17.2.26) are then fulfilled, and tA?/K=A?~* in 


the first case, tA?/K = A*/t? > A?-* in the second case, while 1A?/K =1? in 
the third case. The remaining conditions are therefore satisfied if 
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MsSmin(A!-*?,1), We take M=(e-7’2+1/z)-* in what follows. Since 
K2A in all cases we can replace K~‘/* by e~7/? in the right-hand side of 
(17.2.27), and since K~\/?>MA~11-1/? by (17.2.26) we replace K~1/? by 
Me~™71~/? in the left-hand side. This gives with a new constant 


(17.2.27" MC Y, Je-?D*V 2/24 Y [D*V 222-2!) 4.23 ye-e7/2 VI? 
Ja{=2 


lel $1 


S$ Cle“ O, VI? +2¥ le 7? QV? +7 ]/e7 VII”). 


The point (0,...,0,1) on the unit sphere has no special properties. We 
can therefore choose a partition of unity 1=)'¢; on the unit sphere and a 
partition of unity 1=)'y(T—k) on R such that suppyc(—1,1) and 
(17.2.27) is applicable to V,,=@(w)W(T—k)V(Tw) for all j and k when 
VeEC®((Tp, 00) x S"—*). If we sum over j and k we find that (17.2.27)' is valid 
for V itself with a larger C. In the left-hand side we just use that at most a 
fixed number of supports can overlap, and in the right-hand side we use the 
later terms in (17.2.27) to take care of terms where a derivative falls on ¢, 
or on w. 

Fix ¢ now so that 0<e<6. Since 


RV)S ¥ le TD*V|7?/c4+ Y |D*V 227-741 
Ja] = 2 Jo[S1 


+ C, eft 2)To Y je 27/2 Dt V [223-2 I4), 


lals1 
it follows from (17.2.27) if T, and t are large enough that the term CR(V) 
in (17.2.24) is less than ||Q,V||? plus one half of the left hand side. Thus 
ety le? OVI? +Atie7 VI? $2(C +1) 10, V |. 


If this estimate is used in the right-hand side of (17.2.27)' we obtain (17.2.21)’ 
which completes the proof. 


17.3. The Dirichlet Problem 


In the study of subharmonic functions in Section16.1 we discussed the 
Dirichlet problem for the Laplacean in a half space. We shall continue the 
study here, adding L? estimates and some variable coefficient theory. This 
will allow us to sketch with a minimum of technicalities various methods 
which have been used in the study of general boundary problems for elliptic 
differential equations but which will not be covered by this book. 

First we shall just study the Dirichlet problem for the Laplacean in 


R"_ ={xelR"; x, >0}. 
It consists in finding for given f and ¢ a solution u of 


(17.3.1) Au=f in R*; u=¢ on OR". 
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If @ is defined in R" and sufficiently smooth, then replacing u—@ by u and 
f—A4A@ by f reduces (17.3.1) to the Dirichlet problem with homogeneous 
boundary data 


(17.3.1Y Au=f in RR"; u=0 in OR", 
and we shall study it in that form. 


We recall from Appendix B the notation H.(R”) for the set of re- 
strictions to IR" of functions in H,,)(R”). By Corollary B.2.5 we have 


(17.3.2) A(R") = {uel (R",); DtueL(R“), |a| = 1}. 


It follows from Theorem B.2.7 that every ueH,,,(R ") can be considered as a 
bounded continuous function of x,¢R, with values in Haake 1). If uy 
=u when x,20 and u,=0 when x, <0 then u,€l7(R") and 


0Up/Ox, = u(.,0)@d(x,)+(Cu/Ox,)o,  Oup/Ox,;=(Cu/Ox)y if jn. 


Thus u)¢H,,,(IR") if and only if u(.,0)=0. 
The following is an analogue of Theorem 13.2.1, and the proof is similar. 


Theorem 17.3.1. Let P(x,D)= ¥ a,(x)D* have continuous coefficients in a 


lal $2 


neighborhood of 0, and assume that p(é)= ae a,(O)E* is elliptic with real 


lal = 
coefficients. If X is a sufficiently small iélehborhood of 0 in R” and X, 
=XOR",, Xy>=X OR", there is a linear operator E in P(X ,) such that 


(17.3.3) P(x,D)Ef=f, Ef=0 inX, if fel?(X,); 


(17.3.4) EP(x, D)u, =u, if u, is the restriction to X , 
of some ue C>(X) with u=0 in Xo; 


(17.3.5) D*E is a bounded linear operator in L?(X ,) if |u| <2. 


By (17.3.5) we have D*Efel(X,) for jal|S2 if fel?(X,). Thus 
XYEfeH,,(R*.) for every ~eCg(X), so boundary values of Ef are defined in 
6% 


Proof. We may assume that p(é)=|é|* for this can be achieved by a 
linear change of variables respecting the boundary plane. Let e be a funda- 
mental solution of p(D) which is even in x,, for example the Newton kernel 
(Theorem 3.3.2). If fel?(X ,) we define Tf =f in X ,, Tf(x’, —x,)= —f(x',x,) 
if (x',x,)eEX,, and Tf =0 elsewhere. Then TfeL%,,,,(IR") if X is bounded, as 
we assume, and Tf is odd as a function of x,. Hence e * Tf EH(35(R’) is also 
odd in x, so the restriction to the plane x,=0 must vanish. Let E, f be the 
restriction to X ,. Then 


(17.3.3)  p(D)E,f=fin X,, E,f=0 in X, if fel?(X,); 
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(17.3.4y Eop(D)u, =u. if u, is the restriction to X + 
of some ueCP(X) with u=0 in X,; 
(17.3.5) D*E, is bounded in [7(X ,) if |a|<2. 


We have already verified (17.3.3). To prove (17.3.4) we first observe that 
Tu, is continuous. Hence D,Tu, is the even extension of D,u,, which is 
continuous, and D? Tu, is the odd extension TD?u, of D?u,. This means 
that Tp(D)u, =p(D)Tu, so e* Tp(D)u, =e*p(D) Tu, =Tu, which proves 
(17.3.4). (Here it is important that p(D) has no term which is odd in D,.) 
Condition (17.3.5) follows from Theorem 10.3.1 for example. 

Now we just copy the proof of Theorem 13.2.1. Writing 


P(x,D)=p(D)+ ¥ b,(x)D*, 
lah S2 


where b,(0)=0 when |a|=2, we look for a solution of the equation 
P(x, D)u=p(D)ut ¥ b,(x) Dtu= fel (X ,) 


which is of the form u=E,g, gel?(X ,). By (17.3.3) this guarantees the 
Dirichlet condition for u, so we only have to solve the equation 


gtAg=f; Ag=)b,D*Eog. 


The operator A in 1?(X ,) has norm <4 if )° sup|b,| is small enough. Then 
we define 
E=E,(I+4)-! 


and deduce (17.3.3)-(17.3.5) from (17.3.3)'-(17.3.5) exactly as in the proof of 
Theorem 13.2.1. If the coefficients b, are not small we can apply this con- 
clusion with b, replaced by b,(ex)e?~'*! if ¢ is small enough. Taking ex as a 
new variable we then obtain Theorem 17.3.1 with X replaced by eX. The 
proof is complete. 


Remarks. 1. If a,eC® we can adapt the proof of Theorem 13.3.3 to show 
that there is a linear map E: 7 (R"_)—> Hy) (R”) such that 


P(x,D)Ef=fin X,, Ef=0 in X, if fel (R"); 
EP(x, D)u, =u, in X, if u, is the restriction to X, 
of some ueCZ(X) with u=0 in Xq; 
D°Efe?(R"), |al<s+2, if Dtfe?(R"), lalss. 
Here s is any integer 20. 

2. In Theorem 17.3.1 we assumed p(D) real just to have a simple explicit 
solution of the constant coefficient Dirichlet problem (17.3.3)' given by the 
reflection argument. However, this is by no means essential. If n>2 or n=2 
and the zeros of p(1,t) are in opposite half planes Imt20 then we can 


easily construct E, by taking Fourier transforms with respect to x’ and 
solving an ordinary differential equation. The same approach is in fact 
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applicable to elliptic operators or systems of arbitrary order with quite 
general boundary conditions. To obtain a sufficiently general framework for 
perturbation arguments one should then start by solving the constant coef- 
ficient boundary problem with inhomogeneous boundary data too. This 
approach was used systematically in Chapter X of “Linear partial differen- 
tial operators”, but in the study of boundary probiems in Chapter XX 
below we shall use another more constructive method. 


(17.3.3) and (17.3.5) give a local existence theorem for solutions of the 
Dirichlet problem (17.3.1) such that D*uel?, |a|<2, if feZ?. One can pass 
to a global existence theorem with the methods used in the proof of 
Theorem 17.2.7 for example. We shall not do so here but show instead how 
to use (17.3.4) and (17.3.5) to prove a regularity theorem analogous to one 
for interior regularity at the end of Section 17.1. 


Theorem 17.3.2. Assume in addition to the hypotheses in Theorem17.3.1 that 
a, is Lipschitz continuous when |«|=2. If D*ueL?(X ,), |a| <1, and 


P(x,D)u=feLl(X,), u=0 in Xo, 
it follows that D*ueL?(Y,), |x| <2, for every YE X, and that 


(17.3.6) ¥ [D*ullpay.)S Cr(IP Os Py ultra » {| D* ull pax.) 
alsi 


Jal =2 


Proof. Choose yeC?(X) equal to 1 in Y¥, and set v=yu. Then P(x, D)v 
=geL?(X,), D'vel?(X,) when |a/<1, v=0 in Xo, and v=0 outside a 
compact subset of X. Choose ¢eCH(R"~*) with 620, | o(x’)dx’ =1, and set 
v,(x)=J v(x'—ey’, x,) d(y’)dy. 
By Minkowski’s inequality we have with || || denoting the norm in 1?(X ,) 
|D*v,||S||D* vl], |a| S11, 


and D%v,e1?(X ,) if D* has at most one factor D,, for we can let the others 
act on ¢. Since P(x,D) can be divided by the coefficient c of D?, we may 
assume that c=1. Then 


P(x,D)v,-g,= >, (a,D*v,—(a, D*v),). 
an< 2 . 


When |«|=2 we can write D*=D,D, for some jn and apply Friedrichs’ 
lemma (Lemma 17.1.5) for every fixed x,. This gives 


| P(x, D)v,—g,|| 70, «0. 


In particular it follows that D? v,eL?(X ,). 
By (17.3.4) and (17.3.5) we have 


(17.3.6)' y) |Dtw, | SCI P(x, D)w, | 


jal $2 
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if w, is the restriction to X, of a function weC?(X) vanishing in XQ. If 
Tv, is the odd continuation of v, from IR", to R" then D*Tv,eL? if «,<2 
(see the proof of Theorem 17.3.1). If we apply (17.3.6)' to the regularizations 
w of Tv, by convolution with even functions, it follows at once that (17.3.6) 
can be applied to w, = v-. When € — 0 we conclude that D%v € L?, |a|<2, and 
that (17.3.6) is valid. The proof is complete. 


The Dirichlet problem can be solved with great ease and in great 
generality by means of Dirichlet’s principle. Consider a differential operator 
of divergence form 

P(x, D)u= D(a, D,,u) 


where (a,(x)) is a real positive definite symmetric matrix which is a con- 
tinuous function of x in the closure of an open bounded subset X of R". If 
P(x, D)u=f in X the equation is equivalent to the weak form 


(17.3.7) (f,v)= [fodx = [ axDyuDjudx, v € CE°(X). 


This condition makes Sense if D*ueI?(X), |a| <1, and remains valid then for 
all v in the closure H of C3(X) in A,,,(QR”). If yeCPdR") it is clear that 
xu € A for every u € H. If AX € C! at x9 € OX we can choose a C! map of a 
neighborhood of 0 in R" on a neighborhood X, of x, mapping R" to X 
and conclude if yeCP(X,) that UV (WeH (R") and that yt) = 0 on 
aR", . Conversely, if OXeC ' and this condition is fulfilled at every boundary 
point then ueH. (We could also identify H with the space A(X) of 
distributions in H,,,(R”) with support in X, for no such distributions have 
support in 0X.) The condition ueH is therefore a generalization to an 
arbitrary domain of our previous statement of the homogeneous Dirichlet 
condition. The Dirichlet problem can thus be restated as follows: Find ueH 
satisfying (17.3.7) for all veH. This is Dirichlet’s principle. 
The solution is extremely simple. Write 


Olu, v)=)ay,DjuD,v dx; u, ve C%(X); 


which is a non-negative hermitian symmetric form. With || || denoting the 
L? norm we have 


YDy4l?SCQu,u), ueCH(X). 
The left-hand side is equivalent to |u| oO since for example 
flul?dx= —2Refx,udi/dx, dx <4 \ul?dx+2[|x,D, ul? dx 
and this implies 
ull? S4 lx, Dy ull. 


Hence (Q(u, u))? is a norm equivalent to ||ul| a) on Co(X) and therefore on 
H. For every fe12(X) we have 


(LMSISI lol SC QW,  veH, 
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which proves that there is a unique uéH satisfying (17.3.7) for every veH. 
Taking veC? we obtain P(x,D)u=/f, and ueH means that the homo- 
geneous Dirichlet boundary conditions are fulfilled. We can strengthen the 
conclusion if 6Xe¢C? and a,, is Lipschitz continuous. In fact, at any bound- 
ary point we can then take a C? diffeomorphism yw as above which flattens 
out the boundary in a neighborhood. Using Theorem17.3.2 we obtain 
D* uel? (X) when |a|<2, and 


>, D* ull pay SC IWF We2a0- 
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If ay, f and 0X are smooth it is not hard to deduce that u is also smooth. 


We shall finally outline another classical method for solving the Dirich- 
let problem and indicate how it will be modified in Chapter XX. For a 
change we shall now emphasize the role of the boundary data. Thus we 
assume that X CIR” is bounded and that 0X¢C%”, and we consider the 
Dirichlet problem for the homogeneous Laplace equation 


(17.3.8) Au=0 in X, u=@ on OX. 
If X were a half space, then the solution given in Section 16.1 would be 


ue= 2] (y) OE(x — y)/6n, dS(y) 


where E is the fundamental solution given by the Newton kernel (Theo- 
rem 3.3.2), n is the exterior unit normal and dS the surface area on 0X. This 
is just another way of stating the reflection method used to prove Theo- 
rem 17.3.1. In the general case one therefore tries to find y so that the 
double layer potential 


u(x)=2 | W(y) 0E(x— y)/én, dS(y) 


will satisfy the boundary condition in (17.3.8); u is automatically harmonic 
in X. This gives an equation of the form 


w+Ky=¢ 
where K is a compact (Fredholm) integral operator. The reason for this is 
that K would be 0 if the boundary were flat so K just expresses the 
deviation from that case. Fredholm theory was developed precisely to solve 
the preceding equation. 

If instead of the Dirichlet problem one is interested in the Neumann 
problem, that is, the boundary condition du/dn=@, one obtains in the half 
space case the solution 

u(x)= —2] b(y) E(x—y)dS(y). 
In the general case one therefore tries a simple layer potential 


u(x)= —2] w(y) E(x —y)dS(y), 
and the problem again becomes a Fredholm equation y+Kyw=¢. 
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To carry these arguments over to general operators and boundary con- 
ditions is rather cumbersome. First one must solve constant coefficient 
problems in all tangential half spaces to find appropriate kernels for the 
problem at hand. Then one makes an “ansatz” as above and has to show 
that it leads to a Fredholm equation. However, there is a useful modifica- 
tion of this approach. By Green’s formula 


(17.3.9) u(x)= | u(y) dE(x—y)/dn,dS— | u,(y) E(x—y)dS 
ax ax 


if uy and u, are the boundary values and the normal derivative of u 
respectively. Thus we know u if uw) and u, are known. The formula (17.3.9) 
defines a harmonic function for arbitrary uy and u, but it need not have 
boundary values and normal derivative equal to uy and u,. Indeed, we 
know that a harmonic function is determined by its boundary values so u, 
=Au, for some operator A if uy and u, are boundary values and normal 
derivatives of a harmonic function. The operator calculus which we shall 
develop in Chapter XVIII will give a quite explicit representation of oper- 
ators such as A. By using the differential equation 4u=0 we can write any 
differential boundary condition in the form Byu,+B,u,=@ where By and 
B, are differential operators in 0X. Thus the solution of the boundary 
problem is reduced to solving the equation (B)+B,A)uj=@ in the ma- 
nifold without boundary 0X. The operator By+B,A is not a differential 
operator but it belongs to a closely related class of operators to which the 
theory of elliptic differential operators, for example, is easily extended. 

These remarks are admittedly and purposely vague. We just hope to 
convince the sceptical reader that there are good reasons for the introduc- 
tion of a fairly large machinery in Chapter XVIIJ, and that the effort it 
requires will be rewarded when we return to boundary problems in Chapter 
XX. 
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We have seen in Chapter XIII and again in Sections 17.1 and 17.3 how to 
extend results on constant coefficient elliptic operators to variable coef- 
ficient ones. However, this gives only the existence of fundamental solutions 
and information on their continuity as operators. One is often interested in 
the precise form of their singularities. We shall therefore present a re- 
markably simple and precise construction due to J. Hadamard, which gives 
the singularities of the fundamental solution with any desired precision. It is 
also applicable in non-elliptic cases where the methods of Chapter XIII and 
Sections 17.1, 17.3 fail completely. In the second half of the section we shall 
indicate this in a special case needed in Section 17.5. The extension to 
general second order hyperbolic equations should cause the reader no 
difficulty. (See also the notes to Chapter XXIII.) 
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Let P be a second order differential operator of the form 
P=—Y d/dx,(gi* 6/Ox,)+ Db’ d/Ox,+¢ 


where g/*, b/, c are C® functions in an open set X cIR" and (g/*) is a real 
positive definite matrix. We wish to construct a right parametrix for P, or 
rather for P—z when zeCR,, since this will eliminate some irrelevant 
difficulties in the constant coefficient case. First we assume that P is equal 
to minus the Laplacean A, 


A=¥,07/ax2. 


The inverse Fourier transform of (|é|?—z)~1 is a fundamental solution of 


—A-—z. We shall also introduce the powers, for these occur inevitably when 
one makes a perturbation (cf. Section 12.5). Thus we set (in the distribution 
sense) for v=0,1,... 

(17.4.1) F(x)=v!(2a)y-" f bP (El? —z)-" dE. 

It follows from Theorem7.1.22 that F,eC™(IR"~.0), and D*F, is a locally 
integrable function also at O when |a|<2 unless v=0 and [«|=2; then we 
also have a term which is homogeneous of degree —n. It is obvious that F, 
is a function of |x|=(x7+ ...+.2)*. It would be easy to give an expression 
for F, in terms of Bessel functions, but that would just suggest lengthy 
proofs for the essential properties which are easily seen from (17.4.1). They 
are first of all 


(17.4.2) (-A-2)F,=vF,_,, v>0; (-4-2)Fy=6,; 
(17.4.3) ~26F/éx=xF,_,, v>0. 


(17.4.2) is obvious, and (17.4.3) follows from the fact that the Fourier 
transform of —20F,/0x,, is 


—2vlig, QE)? -—z)-"-! =(v—1) (DE? —2)~. 


We shall keep z fixed in the following discussion and have therefore not 
indicated in the notation that F, depends on z also. 


As a first step to an extension of (17.4.2), (17.4.3) we pull back the 
identities by a linear bijection T in R’. If y= Tx then 'Td/dy=0/dx so 


A, => 0/6x,g/*d/éx, if (g)=T-''T™!. 


Since F, is rotationally symmetric we may by some abuse of notation write 
F(x) =F, (|x|), and then we have F,(y)=F,(\x|,) if 


IxXl2 =) 8 Xj Xp 


where (g,,) is the inverse of (g’*). Recalling Example6.1.3 we obtain since 
(det T)~ 7 =det(g**) 
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(742 (-Yajea,-2)RUxlJ=vF_ s(x), v>05 
=(det g/)?6, if v=0; 
(17.4.3) —2)) s* 0, F,\x|,)=x;F,_ (xl, v>0. 

The preceding formulas are valid for any symmetric positive definite 
matrix (g/*) but the whole point is that they are applicable also if P has 
variable coefficients provided that the coordinates are well chosen. Recall 
that the principal symbol ) g’*(x)é,¢, of P is invariantly defined in the 
cotangent bundle. The dual quadratic form > & (x) dx,dx, in the tangent 
bundle defines a Riemannian metric. As proved for example in Appendix 
C.5 we can for every point in X introduce geodesic normal coordinates 
which vanish there and satisfy the condition 


(17.4.4) Y Spl), =F 8, (0)%,, fads... n. 
k k 


This means that the rays through 0 are geodesics with arc length equal to 
the distance in the Euclidean metric 


IL, =|lp,0=(X 8jx(0) x; x,)*. 


We have g,,(x)—g,,(0)=O(|x|’) and similarly for g’*. Usually one requires 
that g,,(0)=6,, which can of course be achieved by an additional linear 
coordinate change. However, it will be more convenient for us later on to 
require only (17.4.4). 

From (17.4.4) we obtain if fec! 


Ve) Of (xl )=L 9" O)Af(xl),  f=l,..sn. 


This shows that (17.4.2) remains valid when x+0 with g/*(0) replaced by 
g’*(x). The same is true at 0 in the distribution sense, for replacing g/*(0) by 
g/*(x) can only add locally integrable terms since the difference is O(|x|7). By 
the product rule and (17.4.2), (17.4.3) we obtain if u,eC® and v>0 


(P—z)(u, F,)=vu, F,_,+(Pu,) F,—(hu,—2 x, 6u,/0x>) F,_ 1/2, 
(17.4.5) h(x) =¥- g jx (0) B?(x) x, = YB jx (x) D(x) x. 
Similarly we obtain if Fy(x)=/f (|x|*) 
(P —z) (Ug Fy) = ug(0) (det g*)* 5 +(P up) Fy — 2(hug— 2x, Gu/dx))f". 
When we add for v=0,..., N, it follows that 


N 

(17.4.6) (P—z)>-u, F, =u, (0)(det g/*)* 6) +(P uy) Fy 
Q 

if u, are chosen so that 


(17.4.7) 9 2vu,—hu,+2<x, 0u,/Ox>+2Pu,_,=0, v=0,...,N, 


for this makes the coefficients of f’ and of Fo,...,Fy_, all equal to 0. (When 
v=0 one should interpret u,_, as 0 in (17.4.7).) 
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We shall now prove that the equations (17.4.7) have a unique smooth 
solution with u)(0)=1; no boundary condition is required for u, when v+0. 


Lemma 17.4.1. Let X be an open subset of IR" which is starshaped with 
respect to 0, that is, xeX >txeX if OSt<1. If he C°(X) and h(0)=0 then 
the equation 


(17.4.8) huy =2<xX, Ouy/Ox> 

has a unique solution u,éC™(X) with u.(0)=1. If feC(X) and v>0 then the 
equation 

(17.4.9) (2v—h)u,+2 <x, du,/dx> =f 


has a unique solution u, € C™(X). 


Proof. If we introduce polar coordinates x =rw with weS"~' and r>0, then 
(17.4.8) means for x+0 that éu,/dr=hu,/2r. If uy(0)=1 we obtain 


r 1 
Ug(x)=exp ({ h{s@) ds/2s) =exp (i h(t x) dt/22). 


Since h(0)=0 the quotient h(tx)/t is a C® function of (x,t)eX x [0,1], so 
UypEC”(X) and uy +0 in X. To solve (17.4.9) we set u,=u,v and obtain the 
equation 

vo+rovjor=g, g=f/2uy. 


Thus 0(r’ v)/ér=r’—' g, which gives 
r’ u(reo)= [3 1 elso)ds=r t’~1 g(tr@)dt. 
It follows that 
u, (x) =ug(x) t’~ 1 f(tx)/2ug(t x) dt 
is the only solution of (17.4.9); that u,eC™(X) is obvious. 


Remark. If b’, c and therefore h are square matrices, the preceding con- 
structions work with no essential modification. Only the formula for uy 
becomes less explicit. This makes the Hadamard construction applicable to 
systems with principal symbol )’ g/*(x)é,¢,1 where I is the identity matrix. 
This observation is quite useful, for the Laplacean on forms defined in a 
Riemannian manifold is of this type. 


Let us now just assume that we are given a second order operator P in 
an open set X CR” with C” coefficients and positive definite principal 
symbol p(x, )= > g!*(x)é,é,. Then it follows from Corollary C.5.2 that there 
is a neighborhood V of {0}x X in IR"xX, a neighborhood W of the 
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diagonal in X x X, and a uniquely defined diffeomorphism 


Va(X, yr (y(% y), yeW 


with y(0, y)=y, y;(0, y) equal to the identity, and the principal symbol in the 
X coordinates 


YER) EG =D ys V4(% V7?) 
satisfying (17.4.4). We choose V so that {X;(X,y)eV} is convex for every 
yeX. If (x, y)eW we have a well defined Riemannian distance s(x, y), 


(p(X, y), y) = oy & (0, y) x; %,)*. 


The square is in C”(W). Pulling the functions u,(X, y) defined by (17.4.7) 
back to W from V, we obtain uniquely defined U,eC™(W) such that 


(17.4.6) (P(x, D)-z)> U,(x, y) F,(s(x, y)) 


= (det g’*(y))? 6, (x) + (P(x, D) Uy(x, y)) Fy(s(%, y)). 


This works of course equally well on a manifold. Note that 6,(x) is a 
distribution density on X which becomes a distribution when divided by the 
natural Riemannian density (det g/*)~? dx. 

Since Fy(s(x, y))eC?%*+'—" the error term on the right-hand side of 
(17.4.6)' is as smooth as we please when N is large. All terms are C®™ off the 
diagonal. If we choose yeC™(X x X) with support in W so that y=1 in 
some neighborhood of the diagonal and set 


N 
F(x, y)=x(x, yd U,(x, y) F,(s(x, y)), 


it follows that 
(P(x, D) — z) F(x, y) =(det g’*(y))* 6, (x) + R(x, y) 


where ReC7%+1—". The operator 
F f (x)= F(x, y)f(y) (det g*(y)) “4 dy 


maps 6'(X) to Y’(X) and preserves wave front sets. With # defined similar- 
ly with F replaced by R, we have 


(P-z)F=I+2 


so ¥ is a right parametrix in the sense that Z is as smooth as we wish if N 
is large. Taking the adjoint of a right parametrix for the adjoint of P—z we 
obtain a left parametrix for P—z also. (A simple argument which we 
postpone until Section 18.1 shows that they have essentially the same singu- 
larities.) From these facts it is easy to derive for P the results on existence 
and regularity of solutions proved in Chapter XIII for general elliptic 
differential equations. (In Section18.1 we shall also give a simple general 
method for the construction of a parametrix for an arbitrary elliptic opera- 
tor.) 
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The preceding construction can also be applied to the wave equation 
67/ét? +P 


in IR"*! associated with P, and this will be essential in Section 17.5. If we 
replace z by 1? in (17.4.1) and take the inverse Fourier-Laplace transform 
defined by 


(17.4.1) E(t,xyavi(2ny-"—! f+ E222) dE dt 


Imt=c<0O 


in the sense of distribution theory, we obtain a distribution with support in 
the forward light cone {(t, x); t |x|} (cf. (7.4.7). We have 


(17.4.2) (62/at27—A)E,=vE,_,, v>0; (62/0t?—A)E=5p 0; 
(17.4.3)”" —20E,/Ox=xE,_,, v>0. 
In the following lemma we collect some further properties which will be 
required in this section and in Section 17.5. 
Lemma 17.4.2. E, is a homogeneous distribution of degree 2v+1—n with 
support in the forward light cone, 
(17.4.10) Ey = 27h -my/2yurd—m/2 92 _ 112) p>, 
E, is a C® function of t with values in 9'(IR") when t20, and 

Ok E(+0,.)=O whenkS2v, 0?%t' E,(+0,.)=v!dp. 
Furthermore, 
(17.4.11) WF(E,—E,)={(t,x51,9); ? =|x??, 

t=lE?, rx+¢6=0}, 


and E,~—E,, 6,(E,—E,) are continuous functions of x with values in 9'?*(R) if 
k is an integer with k = (n — 1)/2 —v. Ifk =(n — 1)/2 — v then for x =0 


OE, — Ey) = 2*-" 1 4~™/2(_1* 16 (2) (2k). 
0,(E) —E,) is the Fourier transform of 4sgnt deo(x,t?), where 


(17.4.12) @g(x,17)=(2n)-” fel ae. 
lel < Ie 


Proof. (17.4.10) was proved in Section 6.2 when v=0. Since 
(8? /6t? — A) x4, (t? —|x|?)=(4a + 2n—2) x7 (7 —|xI) 


by the computations preceding Theorem6.2.1, the recursion formula 
(17.4.2)” follows from (17.4.10). By Theorem 6.2.3 the equation (17.4.2)” has 
only one solution E, with support in the forward light cone when E,_, is 
known. Hence (17.4.10) follows by induction. The statement on the wave 
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front set follows at once from Theorem8.2.4 when t+0. When x=t=0 we 
just have to note that the wave front set is closed and that 


(6?/ét? — A)’* (E, —E,) =v !(59,9 — 50,0) =0; 


by Theorem 8.3.1 it follows that 1? =|<|? in the wave front set. 
In particular (17.4.11) implies that E,—E, is a C® function of x with 
values in 9’. Set a=v+(1—n)/2 and let 5=|x|?>0. Then 0,(E,—E,) is a 


constant times the distribution 


d 
C3 =X (t? —6)2|t|=t/|t| ate (t? — 6). 


If @ is an odd test function then <e,,,@>=0, and if @ is an even test 
function then 


C05, P= —Csarie¥r WH=¢o'(0)/2t. 


y is also an even test function, and by Taylor’s formula 
sup |W| <4 sup|o9*. 


Since e, and e,, are obviously continuous with values in 9’°, it follows 
that e,, is continuous with values in 9’ if k and 2a are integers and a 
+k20. We have eg _,=2(—1)*k!6@"/(2k)! 

E, is the limit when e-0 of Ey ,, 


Egttn=@n* J ef 8) +10) — 22)? (| E)2_ 72) 1 ge dr, 
Imt= —1 


Since 
fe®itM(lE|? 1?) dt=2ni?/(—2) (ell 4 eH, 


if the integral is taken from —i—oo to —i+oo and from i+ o to i— oo, it 
follows that O@(E),—Eo,)/ot is the Fourier transform of 
4sgnte—** de,(x,t”). When e—0 we obtain the last statement in Lem- 
ma 17.4.2. The proof is complete. 

With some abuse of the notation we shall write E,(t,|x|) instead of 
E (t,x) in what follows; when t=0 this should be interpreted as the limit 
when t- +0. If the coordinates are geodesic in the convex neighborhood 
Xo Of 0, it follows from (17.4.6) with u)(0)=1 that in X, 


(17.4.6)" (8? /at? + P(x, D) Su, E(t Ixl,) 
10) 


=(det g/*)* 59 9 +(P(x, D) uy(x)) Ex(t,|x1,)- 


The error term is in C* if k<N—(n—1)/2. If |x|,<c implies xeX, then 
E,(t,x)=0 for t<c in a neighborhood of {x;|x|,2c}, so (17.4.6)” remains 
valid in (— 00,c)x X for any X>X, if u, is extended arbitrarily to X. 


As in the elliptic case we can extend the construction to general coor- 
dinates by taking the pullback by the inverse of the map (t,x,y) 
++ (t, y(X%, y), y) to geodesic coordinates. Then we obtain: 
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Proposition 17.4.3. For any open YEXCR" one can choose c>0 and 
UjeC™(X x Y) such that, with s(x, y) denoting the geodesic distance from x to 
y, we have in(—0c,c)x Xx Y 


(1746) (62/002 + P(x, DS U,(x,9) E,(t,s(%,9)) 
19) 


= (det g’*)* 59, + (P(x, D) Uy(x, y)) En(t, sy). 


When s(x,y)Sc the coefficients U, are defined by integrating the equa- 
tions (17.4.7) in geodesic coordinates, and when s(x,y)>c their definition is 
irrelevant. 


If the coefficients of P are in C®%(X) and P remains elliptic in X, we can 
extend the coefficients to a neighborhood of X and then take Y=X. 
However, the situation is much more difficult when we want to construct a 
parametrix for the mixed problem for the wave operator 67/0t7+P in 
R x X with Dirichlet data on 0X. Then Proposition 17.4.3 can only be used if 
t<d(y), where d(y) is the geodesic distance from y to 0X. If P=—A and X 
is a half space, then the “Green’s function” with pole at (0, y) introduced in 
Theorem 12.9.12 is just Eg(t,x—y)—E,(t,x—y*) where y* is the reflection of 
y in @X. We shall prove that this construction can be modified in a manner 
completely analogous to (17.4.6)” when t is not larger than a fixed multiple 
of the distance d(y) of y from the boundary. It is possible to extend the 
construction to all t which are small compared to d?, which has to be 
assumed in order to guarantee that no light rays emanating from y arrive 
tangentially at the boundary. However, this is more laborious and will not 
be essential in the application in Section 17.5. 


Before proceeding it is useful to rephrase the existence of geodesic 
normal coordinates as the existence of an exponential map. Given 


P=—Y'6/0x,(g*6/0x,)+ ¥ b’d/8x;+ 


in X CR" we define again the Riemannian metric ¥ g,,(x)dx dx, where (g j,) 
is the inverse of (g/*), When yeX and s is a tangent vector at y with small 


norm, 
Isley aa Isl, =() 8 ju()SjS,)* 


then one defines exp,s as the point at distance |s|, from y on the geodesic 
with tangent vector s at y. If we introduce geodesic normal coordinates at y, 
then this becomes the identity map. Hence it follows that 


(y, s)->(y, exp, 5) 
is a diffeomorphism of a neighborhood of the zero section in the tangent 
bundle of X on a neighborhood of the diagonal in X x X. (The right setting 
for this is of course a Riemannian manifold.) 
Let us now consider a Riemannian manifold with boundary, or for 
simplicity an open bounded subset X of IR" with C® boundary 0X. We 
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assume that g/*eC®(X) and choose some extension of g/* to a neigh- 
borhood of X. When the geodesic distance d(y) from y to 6X is sufficiently 
small and |s|,<4d(y), say, we shall then define a reflected exponential map 
as follows. First we start from y on the geodesic with direction s. If the 
boundary @X is encountered at a distance <|s|, we shall prove that the 
intersection is transversal and that continuing on the geodesic with direction 
obtained by the usual law of reflection to a total arc length |s|, we obtain a 
point exp’,s with the standard properties of the exponential map. (In the flat 
case discussed above it will be the point (y+s)*.) In particular, (17.4.4) is 
valid for the metric in X when these coordinates are used. When |s|, =2d(y) 
and s has the direction of the geodesic from y minimizing the boundary 
distance, we obtain exp) s=y. 

To simplify the discussion of the reflected exponential map we assume 
that X is defined by x,20 in a neighborhood of OeIR” and that the 
coordinates are geodesic with respect to the boundary plane 0X defined by 
x, =0 (Corollary C.5.3). Thus 


Zin=O for f#n, 8y,=1. 


Let K, be a compact subset of 6X. Translating and changing scales we 
obtain the metric G” defined by 


Y gly’, 0)+ y,x)S,S,=|S2?. 


If (y',0)eK, this is defined in any desired bounded subset of IR” when y, is 
small enough, and when y,=0 we obtain a flat metric with no cross 
products between dx, and dx, for jn. The point y corresponds to the fixed 
point 17=(0,...,0,1). The exponential map 


B= {SeER"; |S, <4} 3Sreexp;,S 


corresponding to G” is a C® function of (S,y). For y,=0 it reduces to Sho 
S+1, so for fixed small y, it is a diffeomorphism in a neighborhood of 
B’.). The n™ coordinate F(S,y) of exp?S is equal to 1+5S, when y,=0. 
Hence the zero set of this function is a C® hypersurface close to the plane 
S,=—1 which divides B” into the starshaped inverse image B’. of X and 
the complement B”_ = B’~ B’. mapped to [X. When SeB”_ the equation 
F(AS, y)=0 has a unique solution A(S, y)e[0,1]; it is a C® function equal to 
—1/S, when y,=0. The point of reflection 


x(S, y)=exp7(A(S, y)S) 


is therefore also a C™ function of S and y, and so is the tangent T(S,y) of 
the geodesic there. When y,=0 it is equal to S. Let T(S, y) be the reflection 
of T(S, y) in the tangent plane of 0X at x(S,y), which just means a change of 
sign for the n" coordinate. Then our reflected exponential map is defined by 


expr S=expys ((1—A(S, y) T(S, y)), SEB’. 
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For small y, this is a C® function of (S, y) and it reduces to 
SH(S,,...,8,-1 -1-S,); S,< 1, |S<4; 


when y,=0. Note that the reflected exponential map gives another coor- 
dinate system in a neighborhood of I. (See Fig. 1.) 

The differential with respect to $ of the total length {S)j, of the broken 
geodesic from IT to x(S,y) and on to Y(S,y)=exp}’S is equal to the 
differential of the length of the geodesic from x(S,y) to Y(S, y) when x(S, y) 
is left fixed, for the reflection law means that it is stationary with respect to 
variations of x(S, y). Thus the surface {expj;’S;|S|},=R} is orthogonal to the 
geodesic from x(S,y) to Y(S,y). For the metric G’” in B” obtained by 
pulling the metric G’ back from X by the reflected exponential map exp)’, 
the rays through 0 are therefore geodesics with arc length |S|}, and they are 
orthogonal to the spheres |S|},=constant. But this means precisely that G”” 
satisfies (17.4.4). We can therefore use the Hadamard construction with 
these coordinates in Q”=exp}” B’_ to construct the reflection of the para- 
metrix of P at y. 

To do so we first rewrite (17.4.6) in terms of the stretched coordinates. 
Set 


B(x, D)= —¥' 4/0x,(g!*((y', 0) + ¥,x)0/Ox,) 
+Yq ¥, Di(y', 0) + yx) 0/8x + ya c((y’,0) + y,x))- 


Then (17.4.6)’" means that 
N 
(67/dt? + B(x, D)) ) Uy’, 0) + yx, Y E(t, 9°(x, I) yn” 
0 


=(det g*(y))* 59,7 + (B(x, D) Uy((y’, 0) + y,x, y)) Ey(t, 9°(x, I) y2% 


Here s*(x, 1) is the G’ distance from IT to x, and we have used that the 
Dirac measure in R"*! is homogeneous of degree —n—1 and that E, is 
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homogeneous of degree 2v+1—n. Let Q, be the operator P pulled back to 
B” by the reflected exponential map. The coefficients of Q, are C® func- 
tions of (x, y), and when y,=0 we have 


Q,=) 8/*((y',0)) 67 /dx, 0x;,. 
Now we determine v,¢C™(B”_) so that 
(17.4.7Y 2vv,—H,v,+2<S, 0v,/0S> +2Q,0 


where H, is defined by (17.4.5), for the operator Q, of course. To satisfy the 
Dirichlet boundary conditions we must impose the boundary conditions 


(17.4.13) v,(S)= yn" U,((y',0)+y,exp7S,y), SeBs, 


=0 


v-1 


where BY = B’ 0B’. It is clear that this determines all v, as C® functions 
of (S,y). When y,=0 we obtain v,=1 and v,=0 for v+0. Define V,(., y) in 
22” so that the pullback by expj;” is v,, and set 


(17.4.14) E(t, x, y) = (U,((, 0) + YX, Vn" Ey(t, 8°(x, 1) 
7s V(x, y) E,(t, s(x, IT))), xe, 


where s””(x, 17) is the reflected G” distance from IT to x. Let Q be the set of 
all (x, y) with xeQ” and y in a small neighborhood w of Kc0dX. Then we 
have &%(t, x, y)=0 in the sense of distribution theory when x, =0, and 


(17.4.15) (0? /at? + P(x, D)) &%(t, x, y) 
= (det g/*(y))* 59, + (B(x, D) Un((y’,0) + y,.x, y)) ya" E(t, $°(x, 1D) 
— (B(x, D) V(x, y)) Ey(t, s(x, ID); (x, y)eQ. 


Note that the error term becomes arbitrarily smooth when N is large. 
Summing up, we have proved 


Proposition 17.4.4. Let Ky be a compact subset of 0X. Then Ky has a 
neighborhood w in X such that (17.4.14) for every N defines a parametrix 6% 
for the Dirichlet-Cauchy mixed problem in Q={(x,y); yew, s”(x, I)<4} 
when t <4; this means that &™(t,x, y)=0 when x,=0, in the distribution sense, 
and that 


(17.4.15)  (8?/dt? + B(x, D)) &(t, x, y)= (det g*(y))* 59, n+ Rylt, x, y) 


where Rye C* when u<N—(n—1)/2. The coefficients V,(x, y) are C® functions 
of (x, y)EQ, and V,(x, y)=1, V,(x, y)=0 for v+0 when y,=0. 


Since the coefficients of the incident wave in (17.4.14) vanish as y?” when 
y, =0, and differentiation with respect to x increases the order of the zero, 
one might ask if the last statement can be improved. However, this is not 
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possible. To see this we shall carry out the proof of Proposition 17.4.4 to 
first order in y,=6 when y’ =0. We may assume that the metric is Euclidean 
at 0 and geodesic inside the boundary, hence 

G’ =Y dx} —26x,H(dx’) 


where = means equality mod O(6’). Here the quadratic form H is the 
second fundamental form of @X at O with normal oriented towards X. 
Since only the second fundamental form at 0 is important, we can as well 
do the calculation in IR" with the metric }'dxj and X defined by 
x, > OH(x’)/2, for this gives rise to a metric G’ of the preceding form when 
one takes geodesic coordinates with respect to 0X. We may assume that 
H(x')=>H a where H, are the principal curvatures, and we shall write 
A(x')=<x', Hx’) with H diagonal. 
The first step is to find 2 so that (AS’,1+AS,)e0X, that is, 
14+A4S,=dH(AS’)/2. 
Thus AS,= —1+0(6) and AS,= —1+6H(S’/S,)/2. The reflection takes place 
at x=(AS’,6H(AS')/2). The normal is there (-—6Hx’,1)=(6HS'/S,,1), with 
length =1. The reflection S of S in the tangent plane is 
S=S—2(5H(S'VS,+S,)(6HS'/S,, 1) =(S' —25HS’, —S,—-25H(S'/S,). 
Thus 
expt’ S=(S'—2(1+1/S,)dHS', —1—S, —5H(S')(1/S? +2/S,)). 
The pullback Q of A by this map is equal to the pullback by the map St» 
S+6y(S) where 


w(S)=(—2(1+1/S,) HS’, (1/Sz + 2/S,) H(S')), 
hence 
Q=A,—6 y 0/08 jW j,0/0S,+6 3 6? w,/0S , 0S, 0/0S, 


where w j,(S)=0w,/0S ;+ Ow /OS,. The function h in (17.4.5) becomes 
6>'S,07 wy /0S , OS, =26(Tr H + H(S'/S,))/S,,. 
The first transport equation is in polar coordinates 
ar dv,/er=hvo 


with boundary condition vy=1 at AS. Thus v,>=1+6w, where 2rdw,/or 
=2(TrH+ H(S’/S,)/S,, Wo =90 when S,=—1. This gives 
wo = —(TrH+A(S/S,))4+1/S,), 
Qv)=6 Aw, =6(—2Tr H(S, +2)/S3 — 6 H(S')(S, +2)/S?). 
The next transport equation (2rd/ér+2)v,—hv,=—2Qv, with boundary 


condition v, =O when S,=—1 has a solution O(6) so we can drop the term 
hv, and integrate explicitly to get 


pv, = —26(TrH + 3H(S'/S,))(1 +S,)/S3. 
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Inductively we obtain for v>0 
(17.4.16) v,=—62°2v—1!N(Tr H+(2v4 1) A(S/S,))(1+S,)/S2"*?. 


In particular, taking S’=0 and S,= —2 we obtain 


Proposition 17.4.5. For the reflection coefficients V, in (17.4.14) we have when 
Yn=0 
OV(IT, y/Oy,= —2-*" *(2v—- I F(y) 


where F(y') is the mean curvature of 0X at (y’,0). Here (—1)!! should be 
interpreted as 1. 
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As in Section 17.4 we denote by P(x, D) a second order differential operator 
of the form 


P(x, D)= —) 6/0x,(g/*6/0x,)+ > b/0/Oxj+c 
with coefficients in C~(X) where X is a bounded open subset of IR" with 


C® boundary. We assume that (g/*) is real and positive definite in X and 
that P is symmetric with respect to a density Ydx with 0< YeC™(X), 


(17.5.1) (P(x, D)u, v)=(u, P(x, D)v)y; uu, vECH(X), 


where 
(u,v) = fudYdx. 


(The density simplifies coordinate changes but should be disregarded as 
otherwise irrelevant.) Let P be the operator defined by P(x, D) in L7.(X) with 
Dirichlet boundary conditions. As explained in Section17.3 we have ueDy 
if and only if P(x, D)u=feL? and u is in the closure H of Co(X) in Hy4)(X). 
Then we have , 


> (g* Du, Dv) +¥ (b/4ju,v)+(cu,v)=(f,v); ve H. 


If veDy we conclude using (17.5.1) that (Pu, v)=(u,Pv),. With v= Yu we 
obtain 


Y |Djull? + lull? SC Pu,wy+ Cz llull?;  ueDy; 
where || || denotes the 2? norm in X. Hence F is closed. As in Section 17.3 
it follows that P+ C,/C, is bijective from Dz to L?, which proves that F is 


self-adjoint. If P(x, D) is replaced by P(x, D)+C,/C, we do not affect any of 
the statements on the eigenfunctions proved below, and then we have 


(17.5.2) lull) SCi(Pu,y); uEDg; 
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where we have used the notation 


llully= ay | D*ull?. 


From now on we assume that (17.5.2) is valid. We shall also write Hay(X ) 
for the set of all u with D*ueL?(X), |a|<k, and with the norm just defined. 
By Cauchy-Schwarz’ inequality and (17.5.2) we have 


(17.5.2) lly SCiIPul, ueD. 


In Section 17.3 we proved the stronger estimate (17.3.6), and we shall 
elaborate it further as follows. 


Lemma 17.5.1. If ueDy and PueH (X) where k is a non-negative integer, 
then ue Hy .2(X) and 


(17.5.3) MllasaSClPullay, uEeDys. 


Proof. For k=0 this follows from Theorem17.3.2. When k>O we may 
assume that the lemma has already been proved with k replaced by k- 1. If 
peCP(IR") we then obtain 


|Au)| a Ssup [| |Pullay+Clulas ySClPulla- 


Hence it suffices to prove (17.5.3) when u has support in a coordinate patch. 
We can then assume the coordinates chosen so that X is there equal to the 
half space x,>0. If j<n then DjweH and PDju=D,;Pu+iPyu where FR, is 
also a second order operator. Hence (17.5.3) with k replaced by k—1 gives 


|D; Masons Cy. i(|D; Pull ag yt lellas dS CWP Ulla: 
This gives the desired estimate for all Du, |a|=k+2, except Dk+?u. Now 
the differential equation gives 
D2u =(g™)"(— SS Dy(g"*Dyu) +...+ Pu) 
jkényn 


where dots denote lower order terms, so the estimate of D‘*?u follows at 
once. The proof is complete. 


A useful consequence of (17.5.3) is that, with new constants, 


(17.5.4) Ullon SC IP*ull, uEeDon. 
This is clear if k=1. If k>1 and (17.5.4) is proved for smaller values of k, 
then 
IFull or C, |Prull, uED ox, 
and (17.5.4) follows from (17.5.3). 


Already the estimate (17.5.2) implies that Y has a discrete spectrum, and 
P is positive by (17.5.2). In fact, if E, is the spectral family and u=E,u, then 
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||Pul|SAljul| so jullySC,Aliul) by (17.5.2). The set of all such u with 
lul| <1 is compact in I? by Theorem 10.1.10. Thus E,1?(X) is a finite 
dimensional space (see e.g. Lemma 19.1.4). Let $,,0,,... be an orthonormal 
basis in L4. for the eigenfunctions such that 0<A,</1,<A,<... for the 
corresponding eigenvalues. Then the kernel of E, is Y(y) times 


(17.5.5) ee rA= Y 0/0)0,0). 


and $,¢ C®(X) since EH y{X) for every k. Here we have used a supple- 
ment to the Sobolev lemma (Lemma 7.6.3) which will also give an estimate 
for the spectral function e: 


Lemma 17.5.2. If v+n/2<k then ueHy(X) implies ue C’(X) and 
(17.5.6) AK-Y— 3" S$” sup|D*ul? < C(lullg,+4*ull?), 221. 


lal sv 


Proof. Decomposition of u by a partition of unity shows that it is sufficient 
to prove this when u has support in a coordinate patch, so we may assume 
that X=IR", and that u has compact support. By Theorem B.2.1 it suffices 
to prove (17.5.6) when ue C>(IR",). Then we shall even prove that 


(17.5.6Y ARTI 4" sup |D*ul?<C( YP |DFull?+A¥iiuil?), — JalSv. 
isl=k 


Taking 47x as a new variable we see that it suffices to prove the estimate 
when 1=1. Then we first prove that 


k 
(17.5.7) » | Djul|? S C(|Dfull? + lull’). 
0 
This is clear when k=1. To prove (17.5.7) for k=2 we observe that 
f (oP —lo' P +0?) de= f |v’ +0’ +0)? dt+|v0)+0'O)?, ve CZCR,). 
0 0 


It suffices to prove this when v is real valued, and then we have 
yp’? —y'? + p2 —(v" ae vp’ as v)? =(v" +0')(v" —p' —p" —p' wee 2v)= ae 2(v" ts v')(v + v’). 


(a deeper reason for this identity is that €*—é€?+1=|é*-if—1|*, EER, 
where €*—ié—1 has its zeros in the upper half plane.) Hence we obtain 
(17.5.7) with k=2 and C=2. Changing scales we have with I? norms on 
(0, oc) : 
jv’? Selo" ||? +e7* Holl?, — e>0, 


which proves that |v'|\*<2\\v”|j |v. If now (17.5.7) is proved for a certain 
value of k, we obtain when applying it to D,u 


k+1 
Y WDjuil? < C(DF* ‘ull? + Dull?) < C(|Dy** ull? +2 [lull | Deull). 
1 
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Since 2C |jul |D2ul| <||D2ul/?/2+2C7\\ulj?, we obtain (17.5.7) with k re- 
placed by k+1 and another C after cancellation of ||D?uj|?/2. 
Using Parseval’s formula for fixed x, we can strengthen (17.5.7) to 


(17.5.7) Y ||Deuil7< p> || D*te||? + |[u|\). 


lal Sk 


This is obvious if «,=0 in the two sums and follows without such a 
restriction if we then use (17.5.7). The proof of (17.5.6) is now reduced to 
proving the estimate 


sup |ul?<Cllullj, we Hyy(R*,), 


Xn>O0 


if k>n/2. By Corollary B.2.6 it follows from the same result in H,,(IR") 
which is precisely Lemma 7.6.3. The proof is complete. 


We can now prove a crude but useful estimate for the spectral function. 


Theorem 17.5.3. There are constants C, such that 


(17.5.8) |D%,e(x, y,ASCACtP?; x, yex. 


Proof. Let fel? and apply (17.5.6) to u=E, f with 2k>|«|+n/2. Since 
ltl op SClP*ull SCA ISI 
by (17.5.4), it follows from (17.5.6) that 
[D*E, f(x)? SCA"? If 7. 
Here 
DE, f()=(Ler=(Esfgy, 8(y)=Dyetx, y, A). 
Thus E,g=g and |gi?<C,Al!+"/?, Now ||P¥gi| </* ||| so (17.5.6) gives 
DF g(y)IP SCypAlal tll 


which completes the proof. 


An immediate consequence of (17.5.8) is that if N(A) is the number of 
eigenvalues </, counted with multiplicities, then 


N(A) = TrEy = f e(x,x, T(x) de = OCW"). 


where Tr denotes the trace. (We just have to integrate the definition 
(17.5.5).) The goal of this section is to prove rather precise asymptotic 
properties of N(A) by estimating e(x, x, /) first in compact subsets of X and 
then at the boundary. 


Remark. If the coefficients of P(x,D) are just in C*(X) and FY is any self 
adjoint extension of P(x, D) with domain Cf which is bounded from below, 
then a slight modification of the proof of Theorem 17.5.3 shows that E, has 
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a C® kernel satisfying (17.5.8) on compact subsets of X x X. If 0X has a 
smooth part @, the coefficients of P(x,D) remain smooth in X Ua, and the 
domain of # contains the smooth functions with support in X Um vanishing 
on w, then (17.5.8) remains valid on compact subsets of (X Um) x(X Ua). In 
these circumstances Y may not have a discrete spectrum so (17.5.5) is no 
longer meaningful. However, the results proved below on the spectral func- 
tion remain valid. 


Although the aim is to study the kernel of E, we shall first examine the 
kernel of another function of A which is easier to approach. A particularly 
convenient choice is the cosine transform of the spectral measure 


cos (t VA)=5 cos (t VaA)dE,. 
0 


To identify the kernel of this operator we take weY(IR) and feCj(x) 
which gives 


J COseVALDrwde= f Wide feos(tV Ades, £2) 
where 
ef FA=(ES fy =ff ex y, 4) S) £0) Yo) Vy) dx dy 


is an increasing function of 4 bounded by || f ||. Interchanging the orders of 
integration we obtain 


J (CostVA LN, wiedt 


3) (b(VA) + B(—VA))de(F, f,d) = HW) + #(—1)) de(f, f,7”) 


§§ £0) £0) YX) YQ) dxdy JH +v —1))d,e(x, y,1?)/2. 


The last interchange of orders of integration is obviously justified if eC. 


Hence the distribution kernel K(t, x, y)e@'(IR x X x X) of frecos(t VP) f is 
the Fourier transform with respect to t of the temperate measure dm(x, y,t), 


(17.5.9) m(x, y,t)= Y(y) (sgn t) e(x, y, t7)/2. 
That this is a temperate measure follows by polarization from the fact that 
|a|? e(x, x, A) +abe(x, y, 4) +abe(y, x, A) +|b|? ely, y, A) 


is increasing and O(4"*) (by Lemma 17.5.3) for arbitrary complex a and b. If 


we can get good information on cos(tY) then inversion of the Fourier 
transformation should give the desired results concerning e(x, y, A). 
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If feC> then u(t,x)=cos (VP) f(x) is equal to f when t=0, we have 
du/ot =0 when t=0, and 
|(G/dty Pull S| ***f | 


for arbitrary positive integers j and k. In view of (17.5.4) and Lemma 17.5.2 
it follows that ueC*(IR x X) and that u=0 on Rx 6X. Furthermore, we 
have 

(P+ 67/at?)u=0. 


Using the parametrices constructed in Section 17.4 and the initial conditions 
u=f and du/dt=0 when t=0 we can therefore reconstruct u approximately. 
Before writing down the approximation we prove a lemma which will allow 
us to estimate the error. 


Lemma 17.5.4. Let ve C®([0, T] x X) be a solution of the mixed problem 
(P+82/d)u=h in (0,T] x X; | 
(17.5.10) v=Q in [0,T] x Ox; 
v=O0v/Or=0 if t=0. 
Assume that 6/h/éti=0 when t=0 if j<k. Then it follows that 
k+1 t k-1 
(17.5.11) oy Dit Fob MySCe (i |DEh(s,.)|dst+ ¥ IDE he lg). 


0 0 


Proof. If k=0 the assertion is a standard energy estimate proved as follows. 
Since (P 0v/0t, v) =(6v/dt, Pv),, we have 


2 Re(h, dv/dt), =0/dt(||Ov/dt\|2 + (Pe, v),). 


Integration from 0 to t gives in view of (17.5.2) 
0o/ 00) Wwlay/Cy 2M | While ds, 
M? = Sup lov(s,./asilr+ llo(s, Iltay/Ca- 
The same estimate holds for smaller values of t, so 
M?2M { |A(s, ily ds. 


which proves (17.5.11) when k=0. If k>0 we have 07v/6t? =h—Pv=0 when 
t=0. If (17.5.11) is proved for smaller values of k we can therefore apply 
(17.5.11) to év/ét and obtain (17.5.11) with the term for j=k+1 missing in 
the left-hand side. Now it follows from (17.5.3) that 


0G Mary SCIP OC Dla 1S COAG Ma 1y t+ ID? 06 a1) 


which completes the proof. 
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Choose c>0 so that (17.4.6)" holds for all (x,y)jeX xX at geodesic 
distance s(x, y)<c. Let d(y) denote the geodesic distance from y to 0X and 
ne ae {yeX ;d(y)>p} for some p<c. If yeX, and t<p the parametrix in 


N 
E(t,x, Y=) U(x, y) E(t, s(x, y) 
0 


is well defined when xeX and vanishes near 0X. If feCq(X,) then 


u(t, x)=] E(t, x, y(det g*(y))F f(y) dy 


is in C%({0,0] xX); u=0 in [0,p]x@X and u=0, du/@t=f when t=0. 
Moreover, u depends continuously on f in the C” topology. This follows at 
once from Lemma 17.4.2 if we introduce instead of y a new variable z with 
exp,z=x, for then we obtain a sum of terms with E,(t,z) acting on a C® 
function of z and x, as a function of z. Thus 


v=cos(tVP)f —du/at 

has zero Cauchy and Dirichlet data, and by (17.4.6)” 

h(t, x) =(P(x, D) + 67/6t7) v(t, x) 

= —J ORy(t, x, y)/dt(det g*(y))* fF) dy, 
Ry (t,x, y) = (P@, D)Un (x, y))En(t, s(x, y)). 
By Lemma 17.4.2 we have RyeC**! if N>k+(n+1)/2, and for |a|<k we 
then obtain 
[Df .. OR y(t, x, y/dt| SC r7N—"— lal, 

Since s(x, y)<t in the support it follows that 


k-1 


t 
J IDEA(s,.)ds+ Do DE-* FA, My Cert tt fins. 
io) 0 

Hence Lemma 17.5.4 gives 


OG Maas) FID, 06 Mag Cert ~ PK fll, £20. 


We may replace k by 0 here. If |x|+5n<k+1, hence if |zj|+n+1<N, we 
obtain using Lemma 17.5.2 with A=t~? that D% ,v is continuous and that 


(17.5.12) IDe,o/scr™—lel-" Vln, 


provided that at most one derivative is taken with respect to ¢t. This 

restriction is removed at once by means of the equation @? v/ét? =h—Pv. 
Set my 

Ky(t, x, y)=dm(x, y, t)~ &,(t, x, y(det g*(y)) 4, t>0, 


which is a continuous function of t with values in 9’(X x X,). If we replace 
f by D* f in the proof of (17.5.12) we obtain 


[De <Ky(t, x,y) D?f (yl SCN Fl" fy, 
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if jat+Bl+n+1<N. Choose yeCPdR") with fxdx=1 and set y,(x) 
=y(x/e)/e". Taking f(y)=7,(z— y) we then obtain 


Di ..<Ky(t.x,-), 42-13 CPN", fal +nt+1<N. 
Letting e—0 we conclude that KyeC’~"~? and that 
\Ds. Ky(t,x, IS CN Walon, |a| SN —n—3. 


Xs yst 


Since dm is even we obtain the same result when t € IR with K n(t,x, y) replaced 


b ~ 
y dma(x, y,t)—8,(8(—t,x,y)—8 (—1,x, det g*(y)) 4. 


Here all terms are continuous functions of (x,y) with values in Z’(IR) so we 
can put x=y and obtain 


Theorem 17.5.5. Let 2 = {(t,x) € R x X;|t| < min(d(x),c)}. Then 
dmix,x,t)— Y, 8,(E,(t,0)—E,(t,0)) U,(x, x) (det g/*(x))-* 


2v<Hn 
is in C®(Q) if n is even, and in C™(Q) after division by \t| if n is odd. All 
derivatives are bounded in Q. For t20 the Taylor expansion with respect to t 
is in both cases 


>, 6,(E,(t, 0) —E,(, 0) U,(x, x)(det g*(x))~?. 
2v2n 
Here U,(x,x) are the coefficients in Proposition 17.4.3 restricted to the diago- 
nal, so they are polynomials in. the coefficients of P, their derivatives, and 
(det g/*(x))~?. 


It follows from Lemma 17.4.2 with the notation (3.2.10) that 
6,(E, (t, 0) — E(t, 0)) = 272° n't ="? ¢2°-"/P(v +. (1 —n)/2) 


if n is even; when n is odd we just have to replace t by |t| unless 2v=n—1 
—2k where k is an integer 20; then we have 


a,(E,(t,0) —E, (t, 0))=272" 21 -/2(— 1k k 1 2 /(2K) I. 


This is clear when t+0 and follows at 0 since the left-hand side is homo- 
geneous of degree 2v—n and even in t, so a distribution with support at 0 
can only occur in the last case which was discussed explicitly in Lem- 
ma 17.4.2. 


If the Fourier transform of a measure du is known in an interval J, then 
we can compute the convolution of du with any function @ such that supp 
cI. The following lemma will give some estimates for du which can then be 
obtained. We denote by ¢ a positive function in “(IR) with [ @(t)dt=1 and 
supp ¢c(—1,1), such as 6=|W|?*|W|? where We C%(—4,4) and the ? norm 
of w is 1. Set ¢,(t)=(t/a)/a, a>0, so that ¢,(t)=@(at) has support in 
(—I/a, 1/a). 
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Lemma 17.5.6. If 4 is an increasing temperate function with p(0)=0, and v is 

a function of locally bounded variation such that v(0)=0 and 

(17.5.13) |dv(t)| S$ Mo(|t|+9)"~' dt, 
(du—dv)*b,()|SM,(\t1|+a,)", TER, 

for some xe[0,n—1] and ay 2a, a, 2a, then 

(175.14) |u(z)—v(2)|S C(Mg a(t] +45)" + M, [2] +.4)({t] +.4,)") 


where C only depends on k and n. 


Proof. Choose cy >0 so that >cpy in (—4,4). Then 


tt+ja 


Coa" I dusdux$,(t)SM,(\t]+a,)°+C, Mo(\t|+a,)""*, 


since 
§¢,(s)(It —s| +a,)"~' ds=J 6(s)((t—as|+a)"~' ds 
S(|t|+a5)"~* | O(s)(1 +|s))"~? ds. 
From this estimate it follows that 
Colu(t)— u(t —a)| Sa(|s| +1)(C, Mo(\t| +a) +aIs\"—! 
+M,(|tl+a,+a|s])*) 


if we divide (0,s) into <{s|+1 intervals of length <1. Multiplication by ¢(s) 
and integration yields 


[w(t)— ph, (| SC, a(Mo(t| +4)" * + M,(It| +.4,)*). 


The first part of (17.5.13) gives such an estimate with yw replaced by v and 
M, replaced by 0. Since integration of the second part of (17.5.13) from 0 to 
T proves that 


(u—v)*,(t)—(u—v) #O, (OSM, |t1(1t] +a, 
the estimate (17.5.14) follows. 


In the following theorem e,(0,A) is defined by (17.4.12), hence equal to 
(22)~" 4"? times the volume of the unit ball. 


Theorem 17.5.7. For the spectral function e(x, y,4) of the Dirichlet problem in 
X we have, if d(x) is the distance from x to 0X, 


(17.5.15) je(x,x, A) Y(x)—e,(0, 4)(det g/*(x))~ 4] < CA"? /(1 +. d(x) A?). 


Proof. We shall apply Lemma 17.5.6 with a=1/min(d(x),c) and 
u(t) =m(x, x, t)= Y(x)sgn t e(x, x, t7)/2 
v(t) =sgn t e,(0,t7)(det g/*(x))- 3/2. 
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By Lemma17.4.2 the leading term in the expansion of dn given by Theo- 
rem17.5.5 is the Fourier transform of dv; the other singular terms are 
Fourier transforms of |t|\""'~?' times smooth functions of x for 0<1< 
(n—1)/2. Hence du—dv)*@, is the sum of the regularizations of these func- 
tions and a bounded function, so (17.5.13) is valid with «=max(n— 3,0). 
Hence (17.5.14) is valid which proves (17.5.15) when d(x)A?>1. Since 
(17.5.15) is a consequence of (17.5.8) when d(x) A#<1, the proof is complete. 


Corollary 17.5.8. If N(A) is the number of eigenvalues SA of the Dirichlet 
problem in X, then 


(17.5.16) |N(A)—(22)-" C, vol(X) a"??| < CAM Y!2 Jog A 


where vol(X) is the Riemannian volume of X and C,, is the Euclidean volume 
of the unit ball. 


Proof. Since 
N(A)= J e(x, x, A) Y(x) dx 
x 


we obtain (17.5.16) by integration of (17.5.15). 


(17.5.16) is a vast improvement compared to our earlier bound N(A) 
= 0O(A"?). However, we shall show now that the factor log/ can also be 
eliminated. The interest of this improvement is that an example below 
shows that the error term O(/~1?) is sometimes optimal. We shall also 
show that the boundary @X has an influence proportional to 1-1? and to 
its Riemannian volume. 

The main point is an extension of Theorem17.5.5 which gives better 
information at 0X. To state it we first introduce the map 


aX x [0,c] a(x’, d)r> x(x’, de X 


such that 6+ x(x’, 6) is the geodesic with arc length 6 normal to dX at x’ 
when 6=0. For small c this is a diffeomorphism on a neighborhood X, of 
aX in X. We shall denote the inverse by xt->(y(x), d(x)). Thus d(x) denotes 
the geodesic distance from x to 0X as before, and y(x) is the point on 0X 
with geodesic distance d(x) to x. 


Theorem 17.5.9. If c is sufficiently small and xeX,, 0<t<c, then 


(17.5.17) 1" dnkx, x, t)=1, (x,t) —1,(y(%), d(x)/t,) 


where “ 
(i) I,eEC™(X x [0,c)) and the Taylor expansion at t=0 is given by 


Yi 277 eh M2 42° F(y +(1 —n)/2))~ * U,(x, x) (det g/*(x))7# 
with U, defined in Proposition 17.4.3; 
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(ii) 201 in the support of I,(x’,0,t) and 20=1 in the singular support, 
pol 
1,(x',0,t)— >) VU, x(x’, 0t)) (det 2 (x(x’, Ot)" * 0 * EV, 28) 
0 


eC"(@X x [4,1] x[0,c]), k<p—(n—-1)/2, 
with V, as in Proposition 17.4.4 and E‘(t, x)=0E,(t, x)/0t. Moreover, I,(x', 0,0) 
= Eo (1,2 6)(det g*(x’,0))-*. 


Before discussing the proof, which we cannot complete until Chapter 
XXIV, we shall apply Lemmal17.5.6 to draw conclusions concerning the 
spectral function. 


Theorem 17.5.10. For the spectral function e(x, y, A) of the Dirichlet problem in 

X we have 

(17.5.18)  je(x,x, YY (x) — (eo(O, A) — eo(2d(x), A) (det g* (x) 2 | 
SCMOl+dGy Yk: 

Here and below we sometimes write e(6, X) instead of eo(y, X) when |y| = 6. 

Proof. When d(x)>c>0 this is a consequence of Theorem 17.5.7, for 


(17.5.19) |eg(2d(x), AS CA"? (1 + d(x) A8)— Ot? 


by Corollary 7.7.15, and the right-hand side is O(A"~1”?/d(x)). When d(x)<c 
we shall use Lemma 17.5.6 with a=1/c fixed and 


y(t) =m(x, x, 2) =4 V(x) sgn te(x, x, 22), 
v(t) =5 sgn t(€9(0, 17) — e9(2.d(x), 1) Vo(IT, x)) (det gi*(x))~?. 
From (17.4.12) it follows that 


|deg(x, t7?)/dt}=(2n)-"| ff dS| 
[l= |e] 
where dS is the Euclidean surface measure. Hence 
|deg(x,t7)| S$ C|t|""' dt 
where (27)"C is the area of the unit sphere. This proves the first part of 
(17.5.13). The theorem will follow if we prove that 
(17.5.20) du dv) b,(1S Cel + a(x. 
In fact, Vo (IT, x)—1=O(d(x)), and d(x) e,(2d(x), t7)=O(t"~') by (17.5.19). 
The proof of Theorem 17.5.7 gives the bound C(1+|t|)"~? for the inverse 
Fourier transform of 


Hlt/e)(dm(x, x, t)—t-" I4(y(x), d(x)/t, 0) 
~(E4(t,0) + E5(—t,0))(det gi)~3), 


What remains is to estimate the inverse Fourier transform of 


Plt/c) t-"(I2(y(%), d ()/t, t) — Vo (IT, x) Eo (1, 2d(x)/t)(det g/*)~*). 
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When 0<6<4 and x, =d(x) then 
t" dm(x, x, t)—(E(1, 0) — Vo(IT, x) Ep (1, 2.6)) (det gi*(x))-# 


has uniformly bounded derivatives with respect to (x’,6,t) for small t and 
vanishes when 0=0 or t=0, so this difference is O(@t)=O(d(x)). Hence we 
have, with d=d{(x) 


[12 (x', 8,1) — Vo (IT, x) Ey(1,20)(det g*(x))" "1S? +d, 0S0S4. 


When 4<6<1 it follows from (ii) in Theorem 17.5.9 that such an estimate 
holds after subtraction of sufficiently many singular terms. For a function 
<C|A(t/c)| (tt? +a)/(t +d)" +1) the inverse Fourier transform with respect to 
t can be estimated by 


Cle+d {(t+d)-"dt+ | (t+d)-"di< C(c+d?-"(1(n—1) +0). 


Choose ~ € C£°((—4,4)) even, with 7 = 1 in (1,3) and p = 0 in (0,4). What 
remains is to estimate the inverse Fourier transform of the singular terms in 
I, with v+0 cut off by a factor w(t/d). First we prove that the inverse 
Fourier transform with respect to t of w(t/d) E’,(t/d,2)/d" can be estimated by 
Cd'~"(1+|t|d)"~}”*-”. Introducing t/d as a new variable we reduce the 
proof to the case where d=1. Then 


WD EW, 2)=C, We t(e-+ 24D? r+ D2 2), 


The inverse Fourier transform of the last factor is apart from a constant 
factor e?'*(r+i0)"~ 1/?-” (Example 7.1.17), so the statement follows since the 
inverse Fourier transform of the other factor is in “ Recall that V,=O(d). 


Now 
d?-"(1 + |x] dy" 92S Ca?" + |x"), 


for (n—1)/2—vSn—2 when v+0. This proves (17.5.20) and completes the 
proof of the theorem. 


We shall now prove that the logarithm can be dropped in (17.5.16). To 
prepare for a refinement in Chapter XXIX we shall in fact prove somewhat 
more: 

Corollary 17.5.11. If weC'(X) then 
(17.5.21) lim A"! C(x) e(x, x, A) (x) dx — 
A> 
—A"? 2a)" C,, J (det g*(y)) FU) dy + 
+47E RYE (2m) —" C,_ Jw dS| SC fl Ydx 
x 


where dS is the Riemannian volume element in 0X and C,, is the volume of the 
unit ball in IR’; C is independent of w. 
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Proof. Let c be so small that 0X x [0,c]a(x’, 6) x(x’, 6) is a diffeomor- 
phism. If d(x)>c/2 in suppy then (17.5.21) follows from Theorem 17.5.7 so 
we may assume that d(x)<c in suppyw. By (17.5.8) and (17.5.19) we may 
replace the right-hand side of (17.5.18) by CA”, and this is a better estimate 
if d(x)?-">A®-/?, After modifying (17.5.18) in this way then, we multiply 
by A -”/2 |W (x)| and integrate over X. If n=2 the integral of the right-hand 
side is obviously bounded by the L’ norm of w. If n>2 this remains true for 
the integral over the set where d(x)A?>1. The measure of the set where 
d(x)? -">A@-Y2 is O(2"-1E2—”), and the integral of 4+ over this set is 
therefore O(A1/4-2), The integral of A'2-"/? d(x)?-" over the set where 
d(x)?-"<A"-/? but d(x)A*<1 has a bound of the same type, multiplied by 
logA if n=3. Hence the integral over the set where d(x)A*<1 is 
O(A4—2") log 1) 0 as A> co. 
It remains to show that 


(17.5.22) A? (F h(x) eo(2d(x), A) (det gi*(x))# dx — 
—4-1 AM DI202 q)i-m C,_,Jwds)+0 as Jw. 


To do so we shali use the coordinates (x’, 6). Then the Riemannian volume 
element (det g/*(x))~#dx becomes a(x’,d)dS dé where a(x’,0)=1. Thus 


WO, atx’, 3) W(x’ SC, 
and 

[@9(26, AS CA"72(1 4. 6A8)- Ot? 
by (17.5.19). We have 


c cat 
AB O(L+ 603) FY? d5=A-7 [ tL 40) "F 2 dt 0 
0 0 
for the integral is O(A#) since n2=2. We also have 


It fF 5A) OY dS = f (L427? dt=O(A-#) 


cad 
so we may replace the first integral in (17.5.22) by 
AG—Ml2 T dS { W(x’, 0) e9(26, A) dé. 
ax 0 
Now 


oO oO 
f e0(26,A)d5=4 f eo(6,A)dBé. 
0 


—O 
With the notation é’=(€,,...,&,) we have 
eo(O,A)=(2m)-"f ede, ae’, 


112 <a-& 
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so Fourier’s inversion formula gives 


§ eo(5,Aydb=Qn)i—" [ d&=(2n)'-"C,_, AY? 
ae [| <a 


This completes the proof of (17.5.22) and of Corollary 17.5.11. 


(17.5.21) with y=1 gives in particular 
(17.5.16Y |N(A)—(22)~" C,, vol(X) A"2| << CAM YI, 


We can also choose equal to 1 in a neighborhood of 0X so that the right- 
hand side of (17.5.21) is as small as we wish. If it is possible to determine 
e(x,x, 4) on compact subsets of X with an error o(A~ 1?) we shall therefore 
be able to determine N(A) too with such an error; the boundary contri- 
bution will be 


—4-1 4-209 q)1-"C _ vol(AX). 


This will be discussed in Section 29.3 after the required technical tools have 
been developed. However, we shall now discuss examples which show that 
some additional hypothesis must then be made. 


Our first example is the Laplace-Beltrami operator on the sphere S” 
<IR"*}, The discussion so far in this section has been restricted to open 
sets X <IR". This was only done for the sake of readers who might not feel 
familiar with Riemannian geometry. All arguments were local and can 
therefore be carried over to an arbitrary compact Riemannian manifold 
with boundary; we shall consider the unit sphere S”. 

In R"*? we shall use the polar coordinates x=rq@ where r>0 and weS". 
Then the Laplace operator assumes the form 


A=r-? A, +07/6r? +n/rd/ér 


outside the origin. Here A,, is the Laplace-Beltrami operator in S". If u(x)= 
r“ v(w) 1s a homogeneous function of degree p, it follows that outside the 
origin 

Au=r"-?(4,v+p(utn—1)v). 


Hence 4u=0 outside the origin if and only if v is an eigenfunction of —A,, 
with eigenvalue A=p(u+n—1). Since A assumes all values 20 when ps 
1—n, we obtain all eigenfunctions of —A,, by restricting to S” all distri- 
butions u in IR"*! which are harmonic and homogeneous of degree u<1—n 
outside the origin. (Cf. Section 3.2.) Then Au has to be a linear combination 
of the derivatives of the Dirac measure at 0, so we find that n=1—n—k 
where k is an integer 20, and that 
u(x)= } a,D*E 


laf=x 
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where E is the fundamental solution of A and a, are constants. (When k=0 
and n=1 the logarithmic potential E must be replaced by a constant. 
Otherwise there is no harmonic function homogeneous of degree yu in 
IR"**.) The Fourier transform of u is )'a,&*|é|~7 so it follows that wu is 
supported by the origin if and only if |é|? divides the polynomial }a, &. 
Let N, be the dimension of the space of homogeneous polynomials of degree 
k in n+1 variables, 


Ne (eae =k(1+0(1/k)/n!. 


We define N,=0 for k<0. Then it follows that the multiplicity of the 
eigenvalue A,=k(k+n—1) of —A, is N,—N,_. for k=0,1,.... If N(A) is the 
number of eigenvalues <A of —A,,, it follows that 


N(A,+0)-N(Q,-0)=N,-N,_, =2k"-1(1 + O(1/kK)/(n—-1)!. 
Thus 
AG—ml2(N(A, +0) -N(A,—0)) > 2/(n-1)!, ko, 


which proves that it is impossible to find an asymptotic formula for N(A) 
with continuous main term and error 0(A“—"/2), 

If X is the hemisphere where x,,,>0 then the eigenfunctions of the 
Dirichlet problem in X are the eigenfunctions in S” which are odd with 
respect to x,,,. Let N? and Ny be the dimensions of the spaces of homo- 
geneous polynomials of degree k in n+1 variables which are odd and even 
respectively with respect to x,,,. Clearly 


N= Net Nes Ne = Nea 


since the odd polynomials are products of x,,, and even polynomials. It 
follows that 


Ny Ny 2= Ne — Nea + Ne — Ny = (NG Nea) + ON —Ne_ 1): 


Now N,—N?_, is the dimension of the eigenspace of the Dirichlet problem 
with the eigenvalue 4,, so we obtain again jumps of the order of magnitude 
A“™-1/2" The conclusion is the same as for S", and now we can of course 
realize X by stereographic projection as a ball in R" with a Riemannian 
metric conformal to the Euclidean one. 


Before embarking on the proof of Theorem 17.5.9 we need two additions 
to Lemma 17.5.4. The first is the finite propagation speed for solutions of 
the mixed Dirichlet-Cauchy problem. 


Lemma 17.5.12. Let ve C®([0, 00) x X) be a solution of the mixed problem 
(67/6t?7+P)v=h in [0,0)xX; 
v=0 in [0,0)x@X; v=v, and dv/ét=v, when t=0. 
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Let s(x, y) be the geodesic distance from x to y and set 


y(x) =min(min {t+ s(x, y); (¢, yesupp 4}, 
min {s(x, y); yesupp vy Usupp 0; }). 
Then v(t, x)=0 if t< w(x). 


Proof. By the triangle inequality for s we have 
Wx) -WO)I Ss, y); x, yeX. 


For 0<e<1 we can approximate (1—«)p by w,eC?(X) so that Yow 
uniformly in X when > 0, w, Sy, and |w,(x) —v,(y)| S$(1 —4.2) s(x, y), that is, 


¥ g!*(x) OW, (x)/Ox, OW, (x)/Ox,S1—Fe. 


Thus the surfaces S,={(t,x); t=w,(x)+a, xe€X} are non-characteristic, and 
h, vy and v, vanish below S,. Let A be the supremum of all a0 such that 
v=0 below S,. If A<0 then it follows from the uniqueness theorem for the 
Cauchy problem (Theorem 23.2.7) or the Dirichlet-Cauchy mixed problem 
(Theorem 24.1.4) that v=0 in a neighborhood of S,. (The proofs are just 
slight variations of the energy identity used to prove Lemma17.5.4 but 
would not be convenient to give here.) Hence A must be 0 so v=0 when 
t<w,(x). When e—0 the lemma follows. 

In particular we note that if feC$(X) and s(z,y)<d(y) when zesupp/, 
then s(x, y)<d(y)+t in supp(cos(tP) f), for the triangle inequality gives 
s(x, y)Sw(x)+d(y) for the function in Lemma17.5.12. When t<3d(y) we 
obtain s(x, y)<4d(y). Thus we can study cos(t/P) f then by blowing up a 
neighborhood of y with diameter proportional to d{y) to a fixed size, as we 
did in the proof of Proposition 17.4.4. Before doing so we must prove 
another supplement to Lemmai7.5.4 where parameter dependence is al- 
lowed. 

By P(x,D) we shall denote an elliptic operator of the same form as 
before except that it now depends on a parameter zeZ where Z is a 
compact convex body in IR” for some N. We assume that the coefficients of 
P are in C®(X x Z) and that (17.5.2) is valid uniformly for all P. 


Lemma 17.5.13. Let v,eC([0, T] x X x Z) be a solution of the mixed problem 
(17.5.10) (P+67/dt?)v,=h, in [0,T]xXxZ; 
v,=0 in [0,T]x@XxZ; v,=dv,/dt=0 if t=0. 


Assume that d/h,/ét'=0 when t=0 if j<k. Then it follows that with sup- 
remum taken over [0,T]x X xZ 


(17.5.11) Y sup|Dt, ,0,(6x)ISC > sup|D%, ,h, (t,x). 
lel sk 


t,x,2°"2 
jalt+on<k 
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Proof. Let the sum in the right-hand side be equal to 1. Then it follows from 
(17.5.11) that we have a uniform bound for |Df*'~/v,(t,.)\y if 7Sk+1. 
Differentiating the differential equation (67/6t? + P)v,=h, with respect to z, 
we obtain 
(67/6t? + P) 6v/6z, =0h,/dz, —OP/6z,v,=H. 

We have a bound for ||Di-’~/H||,, if 7Sk—1 and conclude using (17.5.11) 
again that there is a uniform bound for ||Df~/6v/0z, ||) if 7k. Continuing 
in this way we obtain uniform bounds for ||Dk*+'~!*!~J0%0,(t,.)l|() when jSk 
+1 —Jal, |«|<k, and the lemma is therefore a consequence of Lemma 17.5.2. 
The proof is complete. 


Remark, In this lemma it is not necessary to assume that X is bounded; it 
suffices to assume that all v, vanish outside [0,T]xK for some fixed 
compact set Kc X. 


Proof of Theorem 17.5.9. Let Ky be a compact part of 6X as in Proposi- 
tion 17.4.4, defined by x,=0 in local coordinates. Let zeX be close to Ky. 
From (17.4.15)’ we can obtain a parametrix for P when y is close to I 
=(0,...,0,1). In fact, we can first pull (17.4.15) back by the inverse of the 
map 

(t,x, y) ++ Ont, (y', 0) +ynx5y), 
that is 

(t,x,y) > (t/yn, (% — 0, 0))/Yns ¥)s 


to construct a parametrix for 07/dt?+P with pole at (0,y). Then the pull- 
back by the map 


(t, x, y)>(z,, t,(2’, 0) +z, x,(2z’,0) +z, y) 
gives a parametrix for 0?/dt?+ P with pole at (0, y). The composition of the 
maps is 
(t, X, y+ (£/Yqs(x —(Y', 0))/Y,,5(2', 0) + Z, ¥)- 
The conclusion is that with the notation y, =(z’,0)+z, y we have 
Vq(07/8t? + B(x, D)) EM (t/y,,.(% —(Y',0))/Yns Ve) 
= (det g/*(y,))* 59 yYn*) + Rylt/Yns(% —('O)/Yns Ve) 


when y is close to I, |t}<4 and s”*(x, y)<4. We can now argue as in the 
proof of Theorem 17.5.5, but with 


INN 
K,(t, x, ys Z)=Zhdm(X,,V2sZ_ t) 
0 . 
5 ON /Ynr& —(Y', OW Yue Vz) (det g*(y,))* Yn". 
For feC?(R") with support close to IT we have 
v,(t,x)=J Ky(t,x, y,z)D* f(y)dyeC, 
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the Cauchy and Dirichlet data are 0, and 
h,(t,x)= (6? /dt? + P,)u,(t, x) 


has N —n—2-jf| uniformly bounded derivatives with respect to x,t,z when 
Wfilz:<1 and |t}<3. By Lemma17.5.12 we have s*(x,7)<% in the support. 
Thus we can apply Lemma 17.5.13, by the remark following its proof, and 
conclude as in the proof of Theorem 17.5.5 that for z in a neighborhood of 
Ky 

Ky (t,x, y,z)eCX-"-* for x and y close to I. 


In particular the restriction to x=y=TJ] has this property which means that 
“—N nN s 
z,dm(z,z,z,t)—))(U,(z) 27” E,(t, 0) —V,(H, 2) E,(t, 2))(det g/*(z))~# 
0 


is in CN-"-* when 0<t<3 and z is in a neighborhood of K in X; all 
derivatives have uniform bounds. 

Let M be the function corresponding to m when X is replaced by a 
neighborhood Y of X and the coefficients of P are smoothly extended to Y. 
Then we know from Theorem 17.5.5 that 


N 
iM, z,t)—)" U,(z) E,(t, 0)(det g**(z))" #eCX "4 
0 
for small t, hence 
“ay N. : 
zn dM(z,z,z,t)—)) U,(z) 2,” E,(t, 0) (det g/*(z))~* 
10) 


is in C’~"-*, If we put 
a 
I,(z,0=t"dM(z,z,t) 


then (i) of Theorem 17.5.9 holds and, with I, defined by (17.5.17), 
N . 
t7"I,(z', 1/t,tz,) —)) VI, 2) E,(t, 2)(det g/*(z))-* 
oO 


has bounded derivatives of order N—n—4 when t<3. It follows from 
Lemma 17.5.12 that t22 in the support. Now introduce new variables 6 
= 1/te(4, 1) and s=tz,. Then we have t=1/0 and z,=s 0 which shows that 


N 
I,(z’, 0,8) —)) V(1,(2’,s 6)) 0-7” E,(1,2.6)(det g*(z’, 5 8))-* 
0 


has bounded derivatives with respect to 2’,0,s of order N—n—4. This 
proves condition (ii) in Theorem 17.5.9 when 924. The same proof is appli- 
cable when 626 for some fixed 5>0, but the proof for small @ requires 
another argument. 
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I ,(x',t6,t) is a C® function equal to E,(1,0)(det g/*(x’,0))-* when t=O. 
Thus the remaining statement’in the theorem is that t"dm(x,x,t) when x, 
=t6 and 0<0@<? is a C® function of (x’,6,t) for small t, and equal to 
(Eo (1,0) — Eo (1, 26)) (det g’*(x’,0))-* when t=0. For the proof we shall study 
the kernel 

aN 
K(t,x, y, Z)= Zz, dm(X,, Vz5Z, t); 


no approximation is attempted now. If feC7(R”) has support where 
s*(x, IT) <2, say, and I’ norm <1, then 


(17.5.23) v,(t,x)= | K(t, x, y,z) D® f(y)dy 


is a C© function with (070t? + P,)v,(t,x) = 0, the Dirichlet data are 0 and v, = 
D®f, Ov, /Ot = 0 when t = 0. When t < 2 we have s*(x, IT) < 4 in the support 
by Lemma 17.5.12. We wish to prove uniform bounds for all derivatives 
with respect to t,x,z when t=1 and s”*(x,y)<# when yesuppf. Assuming 
for a moment that this has been done, we conclude as in the proof of 
Theorem 17.5.5 that KeC® and, taking t=1, x’=y'=z', x,=y,=0<#, that 


zt dm(x,x,Z,),  Xx=(z',@z,) 
has uniformly bounded derivatives with respect to z’,6,z, when 0<0<? and 
z, is small. When z,=0 we obtain 
K(t,x, y, z)=(Eo(t, 5°(x, y)) E(t, s”7(x, y))(det g*(z))“# 


and the parenthesis reduces to (Eo(1,0)—E,(1,26)) for the preceding choice 
of t,x,y. Changing notation so that z, is called t we shall therefore have 
proved the theorem. 

It is easy to see that with v, defined by (17.5.23) and any k the map (t, z) 
r+v,(t,.) is k times differentiable for t<2 with values in 9’, and that 
derivatives of order <k are uniformly bounded in 9”, if p>|B|+k+n/2. To 
do so we observe that P is self adjoint with respect to a C®” density Y,, 


hence 
(v,(t,+), Py, + DF, AGz(t, 0, -)/As)x, = 0 
if ©,(t,s,x) is the solution of the mixed problem 
(8 /As* + P,)®,(t,s,x) = 05 
@,(t,s,x)=0, if x € OX; 0G,(t,s,x)/Os = —d(x) 
and @,(t,s,x)=0 if x EX, s=t. 


®,(t,s,x) = (sin(t — s)/A)/VF) Hx). 


It follows at once from the spectral theorem, (17.5.4) and Lemma 17.5.2, that 
we have uniform estimates for the (x,#) derivatives of ®, of order <p+1 
—n/2 if @ is bounded in C§. As in the proof of Lemma 17.5.13 we obtain by 
successive differentiations with respect to z that this is true for all (x,t, z) 
derivatives of order <p —n/2 also. If p>|B|+k-+n/2 we conclude that v, has 


Thus 
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the stated differentiability properties. The proof will therefore be completed 
by the following lemma. 


Lemma 17.5.14. Let v,(t,x) be a solution of 
(67/0t? + P) v,(t, x) =0 


when |t|<3 which is a C® function of (z,t,x) when z,>0 and z is close to a 
compact set K,< {zeR"; z,=0}. Assume that 

(i) v,(t,x)=0 when x, =0 (Dirichlet condition) 

(ii) s?(y, (z’,0))<% when (0, y)esupp v, (Cauchy data condition) 

(iii) for every a one can find p such that D%v_(t,.) belongs to a fixed 
bounded subset of Z’"(R"). 

Then there are fixed bounds for the derivatives with respect to z,t,x of 
v,(t, x) when z is in a neighborhood K' of Ky, 1St<2 and s*(x,(z’,0))<2. 


Note that when z,=0 this is quite clear since v, is equal to the solution 
of a constant coefficient Cauchy problem with data satisfying (ii) together 
with the reflection in the boundary plane x,=0. There can be no singulari- 
ties in the considered set then since it cannot be reached by light rays from 
the Cauchy data. The proof of the lemma will be given in Section 24.7 after 
a systematic study of the propagation of singularities of solutions to the 
mixed problem. It will show that the constant coefficient situation remains 
true for small perturbations. 


Notes 


As pointed out in the notes to Chapter XIII the methods used in Sections 
17.1 and 17.3 have a long history. For the elliptic case we might add 
references to Agmon-Douglis-Nirenberg [1,2], Agmon [1,5], Browder [1], 
Garding [2], Lions and Magenes [1], Schechter [1] although this list is far 
from complete. Lemma 17.1.5 comes from Friedrichs [2]. Uniqueness theo- 
rems such as Theorem 17.2.1 were first proved by Carleman [1] in the two 
dimensional case. It was he who introduced the idea of using norms con- 
taining powers of a weight function. It has dominated all later work in the 
field. For second order operators in several variables Miiller [1], Heinz [1], 
Aronszajn [2], Cordes [1], Aronszajn-Krzywcki-Szarski [1] and Agmon [4] 
proved Theorem 17.2.6 in increasing generality for real a,,. Alinhac and 
Baouendi [1] showed that it is then only necessary to assume the coef- 
ficients real at the distinguished point. Counterexamples due to Alinhac [2] 
show essentially that this condition is necessary and also that such strong 
uniqueness theorems do not hold for operators of higher order. We have 
here followed the exposition in Hérmander [41] where weaker conditions 
on the lower order terms are also discussed. (Jerison and Kenig [1] have 
shown that they are not optimal.) Calderén [1] was the first to prove 
general uniqueness theorems such as Theorem 17.2.1 for operators of higher 
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order in more than two variables. Here we have just included his result in 
the elliptic case assuming only that the coefficients are Lipschitz continuous. 
PliS [3] has proved that this condition cannot be replaced by any weaker 
Hélder condition. For the background of Theorem17.2.8 we refer to the 
notes of Chapter XIV. 

The parametrix construction of Hadamard discussed in Section 17.4 is 
essentially taken from Hadamard [1] although the arguments used here in 
some ways are closer to those of M. Riesz [1]. Seely [3] studied a parametrix 
E(t,x, y) with three terms for the mixed Dirichlet problem for the wave 
equation in IR‘ defined when the square of the distance between x and y is 
small compared to the boundary distance of y. He applied it to prove 
Corollary 17.5.1 for the Laplace operator in XcR%. In Seeley [4] the 
results were extended to higher dimensions, and Pham The Lai [1] did so 
too about the same time. In the latter paper it is evident that it is the 
Hadamard construction which is being used, and it is clear that it is not 
essential to start from the constant coefficient Laplace operator. (Some 
estimates seem incorrect but the end results are valid.) 

Carleman [3] found a way to determine the asymptotic behavior of the 
spectral function of a second order elliptic operator with Dirichlet boundary 
condition. (The corresponding results on the eigenvalues go back to Weyl 
[4].) His idea was to study the kernel of the resolvent and then apply a 
Tauberian theorem; this approach was extended to general elliptic operators 
by Garding [7]. Minakshisundaram and Pleijel [1] showed that one can 
study the Laplace transform of the spectral function as well, for this is 
related to the Green’s function of a heat operator. These methods do not 
give precise error bounds, but Levitan [1,2] and Avakumovié [1] disco- 
vered that taking the cosine transform one obtains optimal result result by 
means of the properties of the corresponding wave equation. Thus Theo- 
rem 17.5.7 is due to Avakumovié [1] but our proof follows to some extent 
H6rmander [22] where the results were extended to higher order operators 
(See Chapter XXIX.) As mentioned above, precise results on the spectral 
function close to the boundary were first obtained by Seeley [3, 4] and 
Pham The Lai [1]. They only considered the Laplace operator. Ivrii [3] 
showed by a quite different method that the asymptotic formula for the 
eigenvalues is valid without this restriction, with the error term given by 
Seeley and Pham The Lai. He also showed that there is a better asymptotic 
formula with a second term when there are not too many closed multiply 
reflected geodesics. The proof was greatly simplified by Melrose [7] who 
derived Ivrii’s result from the basic facts on propagation of singularities. We 
have combined these ideas in Section 17.5 to give improved estimates for the 
spectral function near the boundary. The Hadamard type of construction 
from Seeley [3,4] and Pham The Lai [1] is only used in the simplest 
Situation to approximate E(t,x, y) when t is less than a constant times the 
distance from y to the boundary. We have presented the arguments in 
Section17.5 so that they prepare for the full proof of Ivrii’s results in 
Section 29.3. 
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Summary 


There is no major difficulty in extending the study of second order elliptic 
operators in Sections 17.1 and 17.3 to higher order operators. From Theo- 
rem 7.1.22 we know that an elliptic operator P(D) with constant coefficients 
has a parametrix E of the form 


Ef(x)=(Q2n) "fel a(Q) fds, feF, 


where a(€)=1/P() for large |é|. As in Chapter XIII an elliptic operator 
P(x, D) with variable coefficients (cf. Section 8.3) can then be regarded near a 
point xg as a perturbation of the constant coefficient operator P(x), D) 
obtained by freezing the coefficients at xy. The proof of Theorem 13.3.3 
gives a local fundamental solution E of P(x, D) as a norm convergent series. 
However, the smoothness of the terms in the series does not grow in 
general, so it is not suitable for a precise description of the singularities of 
E. The situation can be improved by taking as a first approximation for E 
the operator A defined by 


(18.1) Af (x)=(2n)-"fek= a(x, A) flOds, fe, 


where a(x,é)=1/P(x,é) for large |é|. This means that we define Af(x) 
=E, f(x) where E, is a parametrix for P with coefficients frozen at x. 
Differentiation under the integral sign in (18.1) gives 


P(x, D) Af (x)=(2n)-" fi P(x,E+D,)a(x, 2) f(Odé, fe. 
Here 
P(x, € + D,) a(x, €) = P(x, a(x, 6)+ Y P(x, 6) Dza(x, C)/a! 


=1+0(1/¢)) 


which suggests that P(x,D)A = 1+R where / is the identity operator and R 
is an operator improving differentiability by one unit. A right parametrix 
should then be given by A(I + R)~'=A—AR + AR? —... where the smooth- 
ness of the terms increases indefinitely. 

The preceding somewhat formal argument can actually be justified, but 
it is preferable to move the successive approximation to the construction of 
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a function a such that 
(18.2) P(x,é+D,)a(x, )=14+ d(x, 8) 
with b rapidly decreasing as €- oo. If 

P(x, E)=PA(x,€)4+ P,_ (x, €)4+... 


is a decomposition of P(x,&) in homogeneous terms with respect to €, then 
it is natural to expect a to have an asymptotic expansion 


(18.3) A(X, C)~A_m(Xs 8) +A _ 1% E) 4-1, € 700, 


where a, is homogeneous of degree k. The condition (18.2) then gives if we 
equate terms of equal homogeneity 


(18.4) Pr $) @_m(% 6) = 1, 
Frys 6) Om 1% €) + Fy 1% €) B= (> 6) 
+ P(x, €) Dy, 4_ m(% 6) =0, 


and a sequence of equations expressing P,(x,)a_,,_,(x,€) for any k>0 in 
terms of a_,,,---,4_m—n41- Since P(x,6)+0, €+0, if P is elliptic, the se- 
quence a@_,,, @_m_4>--- iS uniquely determined and gives a solution of (18.2) 
with b decreasing as rapidly as we please. (Compare the proof of Theo- 
rem 8.3.1 which is based on the same principle.) 

The preceding argument indicates that any elliptic operator has a para- 
metrix of the form (18.1) with a admitting an asymptotic expansion of the 
form (18.3). Such operators A are called pseudo-differential operators. The 
reason why they are so useful is that they form an algebra, invariant under 
passage to adjoints and change of variables, so that it can also be defined 
on a C® manifold. The correspondence between the operator A and the 
function a in (18.1), called the symbol of A, allows one to give formulas for 
these operations in terms of the symbols which are as simple as those for 
differential operators. We shall develop these facts in Section 18.1. 

The Schwartz kernel K(x, y) of a pseudo-differential operator is singular 
only at the diagonal. There it is singular essentially as a homogeneous 
function of x ~y, so application of a first order differential operator which is 
tangential to the diagonal does not make K more singular. In Section 18.2 
we study the space of distributions I(X, Y)< D(X) with the same relation to 
a C® submanifold Y of the C® manifold X. For these distributions the 
wave front set is contained in the conormal bundle N(Y) of Y; accordingly 
we call them conormal distributions for Y. They play an important role also 
in the development of the calculus of totally characteristic pseudo-differen- 
tial operators in a manifold with boundary presented in Section 18.3. In 
particular, the invariance under coordinate transformations of these oper- 
ators follows from a characterization of their kernels as conormal distri- 
butions on a modification of X x X. Section 18.3 is intended to give a solid 
framework for the study of boundary problems, in particular a discussion of 
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spaces of distributions in a manifold with boundary and their wave front 
sets. However, little use will be made of the results in the later chapters 
where we use more conventional but non-invariant techniques. The notions 
introduced in Section 18.3 seem so natural though that they are bound to 
play an increasingly important role. 

The calculus built up in Sections 18.1, 18.2 and 18.3 is based on the 
study of Gaussian convolutions in Section 7.6. In Section 18.4 we resume 
their study in preparation for the calculus of operators with general symbols 
developed in Sections 18.5 and 18.6. In addition to operators of the form 
(18.1) with general symbols we also discuss the Weyl calculus which has 
important advantages due to various symmetry properties. For example, a 
real valued symbol always gives rise to a self-adjoint operator; this is why it 
was originally introduced in quantum mechanics. The main theorems in the 
calculus are proved in Section 18.5, and Section 18.6 is devoted to estimates 
for such operators. They will be needed in parts of Chapters XXVI and 
XXVIII, but apart from that it will usually suffice to have absorbed the 
material in Section 18.1 in order to read the following chapters. 


18.1. The Basic Calculus 


When studying operators of the form (18.1) we must first specify the con- 
ditions which a must satisfy. We definitely want to accept all smooth 
functions a(x,&) which are homogeneous in € for large |é| and also their 
linear combinations. The following is a slightly wider class which is techni- 
cally more convenient: 


Definition 18.1.1. If m is a real number then S"=S™(R” x R") is the set of all 
aeC”(IR" xR") such that for all , 8 the derivative a{}(x, €) = 0204 a(x, €) has 
the bound 


(18.1.1) lax, USC, pA +1E)"-"l; x, FER” 


S™ is called the space of symbols of order m. We write S~° =(\S", S° = 
sm. 7 


Definition 18.1.1 is a global version of the special case of Definition 7.8.1 
with 6=0 and p=1. Later on we shall localize S” and also consider general 
p and 6. 

It is clear that S” is a Fréchet space with semi-norms given by the 
smallest constants which can be used in (18.1.1). One advantage of not 
insisting on homogeneity is that a(x, ¢)=y(é) is in S° if yeX The following 
proposition is mainly used in this case where it is closely related to the 
regularization which was introduced already in Section 1.3. 
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Proposition 18.1.2. Let aeS°(R" x IR") and set a,(x,é)=a(x,eé). Then a, is 
bounded in S°, OSe<1, and a, a, in S” for every m>0 when ¢-0. 


Proof. Since a,eS° the statement follows if we show that for OSm<1 
(1+ [Et ag a8(a,(x, 2) —a(x, OSC, pe", OSeS1. 
When «=0 this follows since by Taylor’s formula 


lag)(x, £6) —a,9(x, 0) S Cglecl™. 
When «+0 we just have to use that 


(1+ 1g pitom( t leg lela <1 + elem? + 1Epyietom sh. 


When working with the spaces S” it is useful to keep in mind that 
(18.1.2) aeS™=>a@es"-ll; aes" and beS™ = abeS™*™. 


The proofs are obvious and show for cxample the continuity of the bilinear 
map (a,b) +> ab. 

We shall now prove a simple result which gives a precise meaning to 
asymptotic sums like (18.3). 


Proposition 18.1.3. Let a,eS"’, j=0,1,... and assume that m;— — 00 as j— 00. 
Set m,=max,>,m,. Then one can find aeS™o such that suppac | suppa, 
and for every k 
(18.1.3) a—¥ajeS™. 

j<k 
The function a is uniquely determined modulo S~” and has the same property 
relative to any rearrangement of the series )\a,; we write 


a~)\a;. 


Proof. The uniqueness follows immediately from (18.1.3) and so does the 
invariance under rearrangements. To prove the existence we choose yeC 
equal to 1 in a neighborhood of 0. By Proposition 18.1.2 we can find a 
positive sequence ¢, converging to 0 so rapidly that 


[aga (1L —x(e,E) ax, El < 2-4 + 1Eyrrt— Ml, joel + 1B) Si, 


for 1—yz(e.) 0 in S' when e-0. Set A j. )=(1 —x(€;6)) a,(x, €). Then the 
sum a=)’ A, is locally finite, hence ae C®. Given «, B,k we can choose N so 
large that N2|a|+|8| and m,+1<m,. Then we obtain 


lagORal 2)— FF Ajo SS + Eye 
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Since a,—A,eS-® and A,eS™ when j2k, it follows that 


(18.1.3 | a8 (a(x, 2) — Y aj(x, YS Copal +E) HL 
i<k 
This is the explicit meaning of (18.1.3), so the proof is complete. 


When one has a good candidate for a, the verification of (18.1.3) is often 
simplified by the following observation which shows that little attention has 
to be paid to the derivatives. 


Proposition 18.1.4. Let a,eS", j=0,1,... and assume that m,;-»>—0co when 
joao. Let aeC*(0R" xR") and assume that for all «,B we have for some C 
and yp depending on « and B 


(18.1.4) a(x, ISCU+IEDs x, EER" 

If there is a sequence p,— — 00 such that 

(18.1.3)" a(x, 6) — Yalx, d1<C,0 +10)" 
j<k 


it follows that aeS", m=sup m,, and that a~)a,. 


Proof. Subtraction from a of some AéS” with A~yY a; reduces the proof to 
the case where all a, are equal to 0. The hypothesis is then that a is rapidly 
decreasing when > 00 and that (18.1.4) is valid. We have to show that all 
derivatives of a are rapidly decreasing when é— 00. It suffices to do so for 
the first order derivatives and iterate the conclusion. If 4 is a unit vector we 
obtain using Taylor’s formula and (18.1.4) that 


a(x, € + en) —a(x, €) —<a,(x, 6),en>|SCe(1+lél", xeK, 0<e<1, 
for some C, pw. Hence 
KXaz(x, 6), a>] S Ce(1 +|6])" + la(x, ¢) — a(x, ¢ + en)l/e, 


which gives |a,(x, €)|<C’(1+|é|)"~™ if we take e=(1+|&|)~%. The derivatives 
with respect to x can be discussed in the same way, and this completes the 
proof. 


Definition 18.1.5. If me and h is the reciprocal of a positive integer, the set 
of all ae S®*" such that 


a(x, 2)~S.a(x,2) 
0 


where a, is homogeneous of degree m—jh when |¢|>1, will be called 
polyhomogeneous of degree m and step h. We write aeS™%. When h=1 the 
step size will be omitted. 


The homogeneity in the definition means that 


afx,té)=t"- "a(x, €);  |€[>1, t>1. 
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This implies homogeneity of degree m—jh—|c| for a%, so a, is automati- 
cally in S**"~J" if ajeC® and vanishes for large x. That ajeC® excludes of 
course in general that a, is homogeneous for all €+0, but since the formula 
a~}a;, is only a condition for large |é| and the proof of Proposition 18.1.3 
cuts away singularities of a, for small |€| we shall use this notation even if a, 
is just in C® when €+0 and homogeneous there. 

The symbols in Sf,, are those to which we were led in the summary 
apart from the fact that the discussion there was local in the x variable. We 
shall later on localize S” in the x variables and Sometimes in the ¢ variables 
also. However, before doing so we shall discuss pseudo-differential operators 
in R"” with symbols in S”, for the results are then stronger than in the local 
case and the proofs are cleaner. 


Theorem 18.1.6. If aeS” and ue, then 
(18.1.5) a(x, D) u(x) =(2n)~" f ef a(x, €) a(f) dé 


defines a function a(x,D)ueY, and the bilinear map (a,u) ~ a(x, D)u is con- 
tinuous. The commutators with D, and multiplication by x, are 


(18.1.6) [a(x, D), Dj] = iag)(x, D); 
[a(x, D), x;]= —ia(x, D). 
One calls a(x, D) a pseudo-differential operator of order m. 
The notation (18.1.5) is justified by the fact that if a is a polynomial in & 
then a(x, D) is obtained from a(x,é) by replacing € with D= —id/dx put to 


the right of the coefficients. This follows from Fourier’s inversion formula. 
Sometimes we write Op a instead of a(x, D). 


Proof. Since te it is clear that (18.1.5) defines a continuous function with 
|a(x, Dyu| S(2m)~" J (1 +161)" |a(2)| dg sup a(x, C1 +1E)-™. 
The first relation (18.1.6) states that 
D,a(x, D) u(x) = a(x, D) D u(x) —ia.p(x, D) u(x). 


It follows by differentiation of (18.1.5) under the integral sign since ¢,i(¢) is 
the Fourier transform of D,u. The Fourier transform of x,;u(x) is —D,ii(g) so 
an integration by parts gives the second relation (18.1.6), 


a(x, D)(x;u) =x;a(x, D)u —ia (x, D) u. 
Repeated use of (18.1.6) shows that x*D*a(x, D)u is a linear combination of 
al) (x, D)x* D¥'u; a +a”=a, p+p" =f. 


Hence x*D*a(x, D)u is bounded by the product of a semi-norm of u in £ 
and a semi-norm of a in S”. The proof is complete. 
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Remark. If we introduce the definition of # in (18.1.5), it follows that the 
Schwartz kernel K of a(x, D) is given by 


(18.1.7) K(x, y)=(2n)-" f &*-” a(x, dE, 


which exists as an oscillatory integral (see section 7.8). Alternatively we can 
interpret (18.1.7) as d(x, y—x)/(2x)" where 4 is the Fourier transform of 
a(x, &) with respect to the & variable, defined by an obvious modification of 
Definition 7.1.9. Hence Fourier’s inversion formula gives 


(18.1.8) a(x, =f K(x, x yet dy 


which should again be read as the Fourier transform of K(x,x—y) with 
respect to y. The formulas (18.1.7) and (18.1.8) establish a bijection ao K 
between distributions in S’(R2"). Now a variant of Schwartz’ kernel theo- 
rem (Theorem 5.2.1), which we leave for the reader to verify, states that the 
maps with kernel in ¥'(IR"*") are precisely the continuous linear maps 
from S(IR™) to “’(R"). For any ae¥'(IR2") we can thus interpret (18.1.5) 
as a continuous map a(x,D): S(R")>Y'(R"). The meaning of Theo- 
rem 18.1.6 is that it maps “(IR") into itself when aeS”. Later on this will 
also be proved under much weaker hypotheses on a. 


We shall now determine the adjoint of a(x, D) with respect to the sesqui- 
linear scalar product 


(u,v)=fuddx; u, veF. 
Assume first that ae. The kernel K of a(x, D), given by (18.1.7), is then in 
f and so is the kernel K* of the adjoint, 

K*(x, y)=K(y,x)=(2m)"Jel@"”™ aly,n) dn. 
Since K*(x,x —y)=(22)""Je'” A(x —y,n)dn it follows from (18.1.8) that 
K* is the kernel of b(x, D) where be SY and 
b(x,2)=(2m)~" fel" a(x —y,n)dydy 
=(2n)" fet a(x —y,E-n)dydn 


Since the quadratic form (y,7) ++ 2(y,7) in IR™ has signature 0, determinant 
(~—1)", and is its own dual, the Fourier transform of (22)~"e7'"” is by 
Theorem 7.6.1 equal to e'<**> if §, # are the dual variables of y and n. Thus 


(18.1.9) b(x, €) =e! <P» Ps a(x, €) 


in the sense that the Fourier transform of b is equal to that of @ multiplied 
by e'** (see Section 7.6). The map at+b defined by (18.1.9) is continuous 
in ¥’, so it follows that 


(18.1.10) (a(x, D)u, v)=(u,b(x, D)v); uve; 
for every ac’ if beS’ is defined by (18.1.9). 
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Theorem 18.1.7. If acS" then b(x, €) =e? ?® a(x, LES”, 


(18.1.1)  b(x, 4) ~Yi <D,, D>) a(x, O/k! =), Of DE A(x, 6)/a!. 


(18.1.10) is valid and shows that a(x, D) can be extended to a continuous map 
from ¥' to F', as the adjoint of b(x, D). 


Proof. Choose y¢C2(IR") so that y(£)=1 when |é|<4 and y(£)=0 when 
\é|>1, and set for integers v20 


a, (x, €)=x(E/2") a(x, €), by (x, €) =e PP a, (x, 2). 
In view of Proposition 18.1.2 we have for all v, if (x, é)esuppa,, 
Jaa (x, MS Cop(A +16)" "SC, p(1 +2") 


Hence Theorem 7.6.5 gives 
(18.1.12) Ib, (x, €)— ¥ G<D,, DY a,(x, Sf !1< C, 2". 
j<k 


If |€|>2"*' then the distance from (x,é) to suppa, is at least equal to 
\é| 2" >|é|/2>(1+|é))/4 so the sum drops out and we are allowed to insert 
a factor (1+|é|)~* in the right-hand side. Given € denote by y the smallest 
integer =O with |¢|<2"*+?. Then either w=0 and |é|<4 or yw>O and 
+l <lE|<2"+?, In both cases (18.1.12) with the improvement just men- 
tioned gives 


(I8L12V — by(% 2) Yo EKDz» DY ay (xs S/F" < CU + LED. 
i< 


To estimate 


b(x, 2) byl, H=T(b, 4 106 —B, 06,8) 


we set A,(x, €)=a,, 1(x,¢) —a,(x, 6), B,(x, C)=b,, (x, €) —b,(x, ¢), and observe 
that 
[AG (x, SS Cig 27" 9? 


since 2”—!<|é|<2"*! in supp A,. Hence 
[a2 08 A,(x,2” 1S C2”. 


Since ; 
B,(x, 2” &) = e!<P= Ps?!" 4 (x, 2” €) 


we obtain from Theorem 7.6.5 where the factor ||A|| is now important 
|B, (x, 2” €) — dX (iXD,, D,>/2°Y A, (x, 2” H/j "|S Cy 2"-””, 
that is, fi 
|B, (x, ¢) ~ 2, (i<D,, D>} A(x, O/f NS Cy W—™”. 
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Let k=m+1 and sum for v=y to oo. The sum on the right-hand side is at 
most twice the first term, and (1+|&/)/2" lies between fixed bounds. Hence 
we obtain 


YB) — ¥ EXD, Dd A ODS Ce (+E) 
# i<k 


which combined with (18.1.12) proves that with a new constant C, 
(8111 |b, 2)— ECD, DEV al, HHS Cell +1) 
j<k 


Since 
BG(x, elo PP a(x, & 


a similar expansion is valid for the derivatives of b, which completes the 
proof of (18.1.11) and of the theorem. 


Remark. The proof gives (18.1.11)' with a constant C, depending only on a 
finite number of semi-norms of a in S”. (Since such statements can also be 
recovered from the qualitative statement in Theorem 18.1.7 by means of the 
closed graph theorem we shall usually omit them in what follows.) If a’ is a 
bounded sequence in S” which tends to 0 in C® it follows that 
ef Px, Ds? qr(x, £) is bounded in S" and tends to 0 in ¥’, hence in C®. 


Next we study the composition of operators. 


Theorem 18.1.8. [f a,eS, j=1,2, then as operators in f or # 


(18.1.13) a,(x, D)a,(x, D) =b(x, D) 

where beS™+™ is given by 

(18.1.14) b(x, €) =e?» a, (x, )a2(y, One yax 
and has the asymptotic expansion 

(18.1.15) b(x, 2)~ i EKD,, D,>Y 41% 20, Vw eye x/l! 


=¥ a® (x, €) Dt a,(x, &)/a!. 


Proof. Assume first that a,e% If ueX then a,(x,D)ueS and the Fourier 
transform is 


qr>(2m)-"[elF—™ arly, C)a(e)dg dy. 
Hence 
a(x, D) a(x, D) u(x)=(2n)- 2" [fe 12 a(x, nary, a(e)dé dy dn 
so (18.1.13) is valid with 
b(x, C)=(2m)-" [fet @—™—™ ay (x, m) a2(y, dy dn 
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which is equivalent to (18.1.14) as shown before the statement of Theo- 
rem 18.1.7. Now consider for arbitrary ajeS™ 


B(x, €, y, =e PP a, (x, 1) a2(y, 2). 
Since 


[0% 08 a(x, 0) ao (y, NS Cyp(l + ind H LEN 
it follows from Theorem 18.1.7 and the remark after its proof that 
|B(x, ¢, y,#) ~ 2, ED, Pad? a,(x, 1) a2(y, O)/7! | 
SC,(L + lq *(1 +16)". 


Here C, can be estimated by a finite sum of products of semi-norms of a, in 
S™ and a, in S". Since differentiations commute with e/<?»?”” we have 
more generally 


|a3 00 OF 5 (B(x, € y, n) ~ 2 EP,» D,>¥iay(x.n) ayy, O49) 
SC a.a'e,0°L thn Md + gym, 


where C, ..’,s,,’ has a similar bound. Hence b(x, €)= B(x, €,x, eS™*™ and 
the bilinear map (a,,a,)k>b is continuous from S™xS™ to S™+™. It 
remains to verify (18.1.13) in general. 

Choose yE€C> equal to 1 in a neighborhood of 0 and set 


a’ (x, €)=x(x/v) x(E/v) a,x, ). 


Since x(¢/v)a,(x,£)>a,(x,€ in S™** when v— co (Proposition 18.1.2) we 
have a}(x, D)ua,(x, D)uin # as vo, if we SX Hence 


ay(x, D) a} (x, D)ua,(x,D)a,(x,D)u nF 


If b’(x,¢) is defined by (18.1.14) with a, replaced by a} then b” is bounded in 
S™+™ and converges to b pointwise, hence in ’, so b’(x, D)u-» b(x, D)u in 
F' as v->oo. Since b’(x, D)u=aj(x, D)a}(x, D)u it follows that (18.1.13) 
holds. The proof is complete. 


Theorem 18.1.8 permits us to give a precise form to the discussion of 
inversion of elliptic operators in the summary. 


Theorem 18.1.9. Let acS" and beS-™. Then the conditions 
(i) a(x, D) b(x, D)-—IeOpS-” 
(ii) b(x, D) a(x, D) -IeOp S-° 


are equivalent, and a determines b mod S~”. (I is the identity operator Op 1.) 
They imply 


(iii) a(x, €) b(x, )-leS~! 
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which in turn implies that for some positive constants c and C 
(iv) la(x,o)l>elo if |C|>C. 
Conversely, if (iv) is fulfilled one can find beS~™ satisfying (i), (ii), (iii). 
Proof. (i) and (ii) both imply (iii) by Theorem 18.1.8. From (iii) it follows that 
|a(x, €) b(x, €) —1|<4 for |E|> C, hence 
3 <la(x, 2) b(x, Q<C' Jax, eer", 1g1 >, 

which proves (iv). From (iii) it also follows that 

a(x, D) b(x, D)=I —r(x, D), reS-!, 
We want to invert I —r(x, D) by the Neuman series, so we set 

b(x, D) r(x, D)*=b, (x, D), - b,eS~™~*. 


With b’~)'b, we obtain 
0 


a(x, D) b’(x, D) —I =a(x, D)(b'(x, D) — » b (x, D)) —r(x, D)keOp S~* 


j<k 
for every k. This proves that (i) is valid with b replaced by b’. In the same 
way we can find b’cS~™ such that (ii) is fulfilled with b replaced by b”. 
When 
a(x, D)b'(x,D)-—IeOpS~* and b"(x, D)a(x,D)—IeOpS-” 
then 
b’(x, D) —b'(x, D)=b''(x, D)(I —a(x, D)b'(x, D)) 
+ (b"(x, D) a(x, D) —1) b'(x, D) 

is in OpS~® so b’ and b” satisfy both (i) and (ii). This proves the 
equivalence of (i) and (ii) and also that a determines b mod S~®. It remains 
to prove that (iv)= (iii). The proof is reduced to the case m=0 if we 


introduce a(x, €)(1+|é|?)~"? and b(x,€)(1+|é|?)"/? instead of a and b. Then 
it is a consequence of the following: 


Lemma 18.1.10. If a,,...,a,e¢S° and FeC”(C*) then F(a,,...,a,)€S°. 


Proof. Since Rea,, Ima,eS° we may assume that a, are real valued and that 
FeC™(IR*). We have i 
OF (a)/0x;=) GF /da,a OF (a)/0é ,= >) OF /da, a, 


and a,,)€S°, aeS~*. Hence it follows by induction with respect to |«|+|B| 
that the derivatives of F(a) satisfy (18.1.1). 


vi)? 


End of Proof of Theorem 18.1.9. Assume that (iv) is fulfilled and that m=0. 
Choose FeC®%(€) so that F(z)=1/z when |z|>c. Then b=F(a)eS° and 
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a(x, €)b(x, €)=1 when |&|>C, which strengthens (iii) to 
(a1) a(x, €)b(x,€)=1 when [é|>C. 
This completes the proof. 


Theorems 18.1.7 to 18.1.9 are the core of the calculus of pseudodifferen- 
tial operators. They lead to improved continuity properties: 


Theorem 18.1.11. If aeS° then a(x, D) is bounded in I7(IR"). 


For the proof we need a classical lemma of Schur: 


Lemma 18.1.12. If K is a continuous function in R” x IR" and 


sup J |K(x,y\dxSC,  sup|K(x, yidy SC, 
y x 
then the integral operator with kernel K has norm <C in I? (IR"). 


Proof. Cauchy-Schwarz’ inequality gives 

[K u(x)? SJ1K(x, y)I u(y)? dy [IK(G, yl dy. 
If the last integral is estimated by C, an integration with respect to x gives 

J\K u(x)? dx SC J |K(x, y)| |u(y)? dx dy s C? | u(y)? dy. 
Proof of Theorem 18.1.11. Assume first that aeS~"~*. Then the kernel K of 
a(x, D) is continuous and 
IK(x, y)| S(2m)-"Jla(x, QdEScC. 

Now (x — y)* K(x, y) is the kernel of the commutator 

[x,,[x,,...,[%,,@(x, D)]]] =i! a(x, D), 
by (18.1.6), so this is also a bounded function. Hence 

(1+|x—yl"**|K(x, WIS, 


and the L’ continuity of a(x, D) follows from Lemma 18.1.12. 
Next we prove by induction that a(x, D) is L? continuous if aeS* and k< 
—1. To do so we form for ue SY 


lla(x, D) ull? =(a(x, D) u, a(x, D)u) =(b(x, D)u, u) 


where b(x, D)=a*(x, D) a(x, D)e Op S?*. The continuity of a(x, D) is therefore 
a consequence of that of b(x, D), 
\|a(x, D) ul]? < ||b(x, D) ul] lll] S]b(x, D)| lull. 


From the first part of the proof the continuity of a(x, D) for all aeS* now 
follows successively for k< —(n+1)/2, kS —(n+1)/4,..., hence for k< —1 
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after a finite number of steps. Assume now that aeS° and choose 
M>2sup|a(x, €)|?. Then 

c(x, )=(M —|a(x, 2)|’)#eS° 


by Lemma 18.1.10 since M/2<M —|a(x, )|? and we can choose FeC®(IR) 
with F(t)=1? when t= M/2. Now Theorems 18.1.7 and 18.1.8 show that 


c(x, D)* c(x, D)= M —a(x, D)* a(x, D)+r(x, D), 
where reS~'. Hence 
\|a(x, D) ull? SM |\uli? + (7%, D)u, u) 
which completes the proof since r is already known to be I’ continuous. 
It follows from the proof that the norm of a(x, D) can be estimated by a 


semi-norm of a in S°. There is a very simple proof of L? continuity which 
requires no smoothness at all in € but instead some decay as x— oo. 


Theorem 18.1.11’. Let a(x, be a measurable function which is n+1 times 
continuously differentiable with respect to x for fixed €. If 


¥ f{Dta(x,|dxSM, €eRR’, 


la] Sn+1 


for some M<oo, it follows that a(x,D) is bounded in I?(IR") with norm 
SCM. 


Proof. If tie C¢ the Fourier transform of a(x, D)u is 
nr fAln—E, 2) ae) ae 
A(n, €)=(2n)-" J a(x, Ee 7 dx. 


where 


By hypothesis 
(1+|q)"** AQ, YISC,M 
which implies that 
f\A@—2.OldnSCM, JlA(n—E,S1déSCM. 


In view of Lemma 18.1.12 it follows that the 1? norm of the Fourier 
transform of a(x, D)u is at most CM ||@|, which completes the proof. 


Let us now reconsider the spaces H,, introduced in Definition 7.9.1: we 
have ueH,, ifueS and teLi,,, 


[lu gs) = (2 m)-" f |A(S)I7 (1 +1 |?) dE)? < 0. 
If E(Q=(1+1é}?)"", then E,¢S*° and the Fourier transform of E,(D)u is 
E,(é)u(é), so the definition means exactly that E,(D)ueL’?. This gives a 
precise meaning to the idea that H,, consists of the distributions with 
derivatives of order s in L’, and we have 
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Theorem 18.1.13. If aeS™ then a(x, D) is a continuous operator from H,,, to 
Fg _m for every s. 


Proof. If ueH,, then v=E,(D) uel? and 
E,_,,(D) a(x, D) u=E,_,,(D)a(x, D) E_,(D) vel? 
by Theorem 18.1.11 since E,_,,(D)a(x, D) E_,(D)eOp S°. 


Remark. a(x, D) is also continuous ?H,, >?H,,_ ») by Corollary B.1.6. 


The proof of continuity in Theorem 18.1.11 was based on estimating 
a(x, D)* a(x, D) from above. We shall now prove a stronger one sided es- 
timate which is often important. It is usually called the sharp Garding 
inequality. Various improvements of it will be given later on, and a shorter 
proof will be possible when we have developed stronger techniques. How- 
ever, we give a direct and in principle elementary proof here for the benefit 
of readers who do not wish to go through Sections 18.4 to 18.6. 


Theorem 18.1.14. If acS?"*+ and Rea=O0 then 

(18.1.16) Re(a(x,D)u,u)= —C luli, ueS. 

Proof. (18.1.16) follows from Theorem 18.1.13 if aeS?™. Since 
(a(x, D) + a(x, D)*)/2 —(Rea)(x, D)eOp S?™ 


it is therefore sufficient to prove (18.1.16) with a replaced by Rea. Thus we 
assume a=0O in what follows. To prove (18.1.16) we shall then write a(x, D) 
as a superposition of positive operators with an error of order 2m. We start 
by choosing an even function ¢¢C?(R2") with LV? norm one and define 
we by w(x, D)= (x, D)* o(x, D). Then w is even by (18.1.14) and (18.1.9), 
and we claim that 


(18.1.17) fJvy,n)dydn=1. 
For the proof we observe that if K, and Ky are the kernels of $(x, D) and 
of w(x, D) then 
(22)-" | W(x, dx dé =| K,(x,x)dx=J]|K4(x, y)? dxdy 
=(20)~" JJ lo(x, OP? dxdé 


where the last equality follows from (18.1.7) and Parseval’s formula. 
Having proved (18.1.17) we now set a=a,+a, where 


(18.1.18) a,(x, €)= Jf w(x —y) a(n), (E —n)/a(n)) aly, n) dy dn, 
q(ny=(1 +I nl?)*. 
Since (w(x, D)u,u)=|1¢(x, D)ull?=0 we obtain (W(tx, D/t)u,u)20O if we re- 
place u by u(x/t). Hence it follows that 
(W(t(x—y),(D—n)/t)u,u)20,  ueX, (y,n)eR™, 
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if u is replaced by u(x+y)e7'*™. It is therefore clear that 
(a,(x,D)u,uj20, ueF, 


so (18.1.16) will follow if we prove that a,¢S?". 

When we differentiate with respect to x, under the integral sign in 
(18.1.18), the derivative can be replaced by —0/éy,. After an integration by 
parts the net result is that a has been differentiated with respect to y. 
Differentiation of a, with respect to ¢, is more complicated since 


(0/0¢ + 0/6n ) W(x —y) a(n), (E —n)/a(n) 
=W' (x —y) a(n), (E —n)/a(n)) Fn), 
where F;(y)=q(n)~' 6q(n)/@njeS~* and 


é 
(18.119) WO, =a WEx, C/I 1 =< OW/8x> — C6, OW /0E> 


is another even function in “& Note that ff w'(x, )dxd&=0. Differentiation 
of a, with respect to ¢, therefore gives one term of the form (18.1.18) with a 
replaced by a and one where is replaced by y’ and a factor FeS~' is 
introduced. Inductively it follows that af}, =a{%}—a‘, is a finite sum of 
terms of the form 


(18.1.20) bo(x, 2)=J) wile —y) a(n), (E —n)/a(n)) b(y, n) dy dn 
—b(x, 8) [J vi, ndydn 
where w,¢Y is even and beS?™+1-'4l, The theorem will be proved if we 


show that beS" implies |b, (x, €)|S C(1+|é|"7?. 
a) When | —n|2(1 + €|)/2 we have 1+|q|<3|¢—n| and 


(L+(ep/A+in)si+i[e—n), U+lnbAt+te)si+le—al. 
The factor y, in the first integral in (18.1.20) can be estimated by 


Cy(h+]x yl a(n) +1E —nl/q(n))- 2" +16 -nl?/a(n)?)-® 


for any N. Since q(n)?<1+|n|<3|&—n| the last factor may be replaced by 
(1+|é—n|/3)~*. Hence the integrand in the first term in (18.1.20) has the 
bound 


Cyl +x — yl (6) +1 —yl/q(&))- 24-1 + epee tt-8 


when |€—y|2(1+[])/2 and N>|p|+n+1. The integral of the first factor 
with respect to y and 7 over R?" is finite and independent of (x, &), so we 
obtain the desired bound for the corresponding part of the first term in 
(18.1.20) if N is large enough. 

b) When |€ —|<(1+|€|)/2 we have 


(1+16])<20. +{6)) <3(1 +161) 
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for every 9 on the line segment between € and 4. Hence Taylor’s formula 
gives 


Boxy. =1bO.)— 2, Pee IO— Pr e/aBl 


SC(lx—yl ag) +1 —nl/a(yPr +1“? 


(Note that the scales in yw were chosen to match in this estimate.) Thus 


ff Bex, y, Em (x —y a, (E-n/aMldy dns C(1 + |élP~?. 


24e—ni<1+{8} 


It remains to evaluate the integrals 


W((x—y) a(n),(E —n)/a(n) (vy — x) (n — 2)" dy dn. 


218—nl<1+18l 


To do so we would like to replace q(y) by q(é. With the notation yj in 
(18.1.19) we have for any N when t and 1/t have a fixed bound 


IW s(tz, O/t) —W ,(z, ®) —(t-1) Wi (z, OS Cy(1 +121 + 161)" Me -1). 
Assuming still that |€ —74|<(1+[&))/2 we shall take t=q(y)/q(é). Then 
q(2)(t —1)=4(n) —4(6) =<a'(8),.n —€> + On — El/a(6))"/4(8) 
so replacing (z, 6) by ((x —y) q(é), (€ —n)/q() we conclude that 
IW (x —y) a(n), (E —n)/a(n)) —W (x —y) a(2), (€ —0)/4(2)) 
— <4) (n — SV/alE)> Wi —y) a(0), (6 —)/a(e))I 
SCy(1 +1x—yl a(S) +1 —nl/a(S)y? NA+ 1G)? 
Since 2|¢—n|>1+|¢| implies |f —ml/q(¢)>(1 +12) (24(2)) we obtain 
ee eee w(x —y) a(n), (E —n)/a(n))(y —x)°(n — "dy dq 


=4(2)"'- FIT (f win)? n*dy dn 
+ JJ ViCy, ny? 0? <n, q'(2)> dy dn +O(1+1€)~*)1. 

The first integral on the right vanishes when |a+fj=1 and the second 
vanishes when «+$=0. When |a+f|=2 we get the bound Cq(é'*!-4 
=Cq(é)*"!-?, when |a+fl=1 we get the bound Ca(é*l-ll-1 
=Cq(é)?"'-?, and when a+f=0 the integral is [[w,(y,n)dydn+ 
O((i+|é))~*). If we multiply by bi (x, é/a!B! and sum, we obtain after 
subtraction of the second integral in (18.1.20) 


Ibo SC, Yo R(x, Ad +1Ep- t+ C,0 + lee“? 
lja+p[s2 
This completes the proof. 


Remark 1. In Section22.3 we shall prove some refinements of Theo- 
rem 18.1.14. It will then be important that the preceding proof shows that if 
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0<aeS? then one can find a,eS" such that a(x, D)—a,(x, D) is non-negative 
and 


(18.1.21) lanl $15 CC » (1+ 1E))'!-* fal (x, +. +16). 
ja+p|<2 


This is a special case of the preceding estimate of by. 


Remark 2. The calculus of pseudo-differential operators works equally well if 
we allow symbols aeS” with values in ¥(B,,B,) where B, and B, are 
reflexive Banach spaces. Thus a(x,D) maps SA(IR",B,) to S(R",B,) and 
F(R’, B,) to #'(R’", B,). We just have to replace absolute values by norms 
in the arguments above. When B, and B, are Hilbert spaces the results on 
L? continuity remain valid and the proofs only require obvious modifica- 
tions. Also the sharp Garding inequality (Theorem 18.1.14) is valid if 
aeS?"*+! has values in Y(H,H) where H is a Hilbert space, and the 
function u in (18.1.16) takes its values in H. In fact, if A is a positive 
bounded operator in H then 


(W(x —y)/t,(D~—n)/t)Au,u)20; —y, nelR", t>0; 


when w is defined as in the proof of Theorem 18.1.14, for the spectral 
theorem reduces the proof to the scalar case. Thus the first part of the proof 
remains valid, and the second part requires essentially no change at all. 

The following simple consequence of Theorem18.1.14 gives Theo- 
rem 18.1.11 a more precise form which is often important. 


Theorem 18.1.15. For every bounded subset A of S° there is a constant C such 
that 


(18.1.22) Re(a(x, D)u,u)=>—Céllull?, ueF, 

if aeA, az=0, 0<d<1, and a(x,é)=0 when |E6|<1. We also have 

(18.1.23) |a(x, D)|| Ssup|a[+(Cd)*, ueF, 

if aeA, 0<6<1, and a(x, €)=0 when |é6|<1. 

Proof. Let A,={a/d;aeA and a(x,é)=0 when |€d|<1, 0<d<1}. Since 
6-'<1+€| in supp a it follows that A, is a bounded subset of S’. Hence 
(18.1.16) is valid uniformly with m=0 for all a€A,, which proves (18.1.22). 


Choose yeC™(R") with O<y<1, x(6)=1 when |é|>1 and x(é)=0 when 
\é|<4. Then we can apply (18.1.22) to 


M? (66)? —la(x, 6)? 


if aeA, M=sup|a| and a(x,é)=0 when |6é|<1 for some de(0,1). This gives 
with another C 


M? |lul|?—Re(la|?(x, D)u,u)=—Co|lull?, uel. 
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We have a*(x, D) a(x, D)=|a|*(x, D)+ b(x, D) where b(x, €) is bounded in S-! 
and vanishes when |5é|<1. Hence b(x,)/5 belongs to a bounded set in S° 
so ||b(x, D)/d|| << C’ by Theorem 18.1.11 and 


M? |\u||? — |la(x, D)ul|? = —(C+C)6 flu”. 
This completes the proof of (18.1.23). 


In the following theorem we sum up some smoothness properties of the 
kernel of a pseudo-differential operator, partly encountered already. 


Theorem 18.1.16. Let acS” and denote by Ke ¥'(IR" x R") the Schwartz kernel 
of a(x,D) defined by (18.1.7). Then KeCi(IR"xR") if m+j+n<0, and 
KeC™(R"xR"~\ A) for any m if A is the diagonal {(x,x); xeIR"}. More 
precisely 


(18.1.24) WF(K)  {(x, x, 8, — 6); x, 0ER"} 
which is the conormal bundle of A. We have 

(18.1.25) WF (a(x, D)u)c WF(u), ueS’, 
(18.1.26) sing supp a(x, D)ucsing suppu, ue”. 


If aeS~™ then a(x, D) S’<C™. 


Proof. (18.1.7) is absolutely convergent and remains so after j differentiations 
if m+j+n<0. This proves the first statement. To prove the second one we 
observe that if y, eC? then x(x) K(x, y) Wy) is the kernel of the operator 


urvrya(x,D)wu, ueZ, 


which is in OpS~~ by Theorem 18.1.8 if suppyasuppy=9. Hence the 
kernel is in C® then. The more precise result (18.1.24) follows at once from 
Theorem 8.1.9 with (x, y,@)=<x—y,9>. If uwe&’ then Theorem 8.2.13 gives 
(18.1.25) which implies (18.1.26). (It is also easy to prove (18.1.26) directly 
when ueé’, for singsupp a(x, D)ucsuppusince KeEC™(R" x R’~ A), and we 
can write u=u,+u, with u,eC>, hence a(x, D)u,eC™, and suppu, close to 
sing supp u.) If aeS~~ and ue FS’ then 


Di, a(x, D) u(x) =(2)-" a, D3(e' a(x, 2), 


for this is true when ue and the right-hand side is a continuous map from 
F' to C°. Hence a(x, D) S’ <C™. If x, weC% and y=1 in a neighborhood of 
supp yw, it follows that 


w a(x, D)u—w a(x, D)(xu)= a(x, D)(1 —yx)uec”. 
Thus WF(wW a(x, D)u) < WF(yu) which yields (18.1.25) for any ue’. 


Remark. (18.1.26) is often called the pseudo-local property while (18.1.25) is 
referred to as the microlocal property of a(x, D). 
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Before discussing the effect of a change of variables on Op S” we observe 
that if aeS” then a direct computation gives 
a(x, D) (ei? uae a(x,D+é)u;  ueS’, EIR". 
If we take u(x)=v(ex) where beC? and v(0)=1, it follows when e—0 that 
(18.1.27) a(x, D) e* =a(x, Eye? 


This gives a convenient way to recover the symbol from the operator. 


Theorem 18.1.17. Let X and X,. be open subsets of IR" and x: X>X, a 
diffeomorphism. If aeS" and the kernel of a(x,D) has compact support in 
X xX then 


(18.1.28) a,.(K(x), n) =e 1 O™ a(x, D) ef KOM, 


a,(y, €)=0 if yéX,, defines a function a,¢S™ such that the kernel of a,(x,D) 
has compact support in X,. x X,, and 


(18.1.29) (a,,(x, D) ujowx=a(x, D)(uck), ue’. 
For a, we have the asymptotic expansion 
(18.1.30) —— ag(ae(x),)~ Ya (x, %«' (x) m) De PO at}, 


where p,(y)=Kk(y)—K(x)—k(x)(y—x) vanishes of second order at x. The 
terms in the series are in S™~'l/2, 


Proof. If we show that a,eS” then (18.1.28) means precisely that (18.1.29) is 
valid when u(x)=e'<*. This proves (18.1.29) since both sides are con- 
tinuous from #’ to &(X) and linear combinations of exponential functions 
are dense in Y’. Before the proof that a,eS” and that (18.1.30) is valid we 
observe that 


(18.1.31) $4 (%, 9) = De el6PxO™) 


is a polynomial in 7 of degree <j|a|/2 with C®™ coefficients. In fact, a 
differentiation producing a factor 7, also brings out a derivative of p,(y) 
vanishing at x. If k derivatives bring out a coordinate of 7 each, we must get 
a zero term unless |a|—k2k, that is, 2k<|a|, so that there are enough 
derivatives left to remove these zeros. This shows that the terms in (18.1.30) 
are indeed in S”~'/? so that the asymptotic series is well defined. Note the 
values of the first few polynomials ¢,: 


(181.31) do=ls b=0, |al=1; 
balx.n)=Di<K(x),n>, lal =2 or 3. 


By Proposition18.1.4 we shall obtain that a,(«(x),4)eS”, hence that 
a,€S", and that (18.1.30) is valid, if we just prove estimates of the form 
18.1.3)”. In fact, differentiation of (18.1.28) gives a finite sum of monomials 
in 4 multiplied by a similar expression with another a, so (18.1.4) follows. 
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Choose ECP (X) so that o(x)=(y)=1 for (x, y) in a neighborhood of 
the support of the kernel of a(x, D). Then 
(18.1.28)  a,.(x(x), 1) = P(xpeF" a(x, D)( G(x) eh), 


which shows at once that a,eC®™. To study a, for large y we introduce the 
Fourier transform 


G(E,n) =f H(y) KOMP “1D dy, 


The differential of the phase is ‘x’(y)n—€. If |x’(y|SC and |x’(y)" "SC, 
yesupp ¢, then 


Nx'(y)n—El2e/2 if [2/22 Cal, 
I'x'(y)n—El2C7* In—"x'(y)“* Elz |nl/2C if Cle] <|n|/2. 
In both cases we can normalize the exponent by writing 
<K(y), 1) — CY, > = O(CK(y), n/w@> —<y,6/@>), = @=1E] +I] 
and obtain using Theorem 7.7.1 for any N 
P(E,mlSCy(L+1El+ln)-™ — unless |y|/2C<|é|<2C|n]. 


Now choose yeCf(R") equal to 1 when 1/2C<|é|<2C and equal to 0 
when |é|<1/4C. Then 


a(x, D) (p(x) "= 1, (x, n) +1,(%,n), 
where 
T(x, n)=(20)-" fk a(x, &) BE, 0) (1 — (Eni) dé 


decreases faster than any power of 1/(1+|y|) as 7 00 and 


I(x, n) =(co/2 2)" f ef OO —% 7 + OV) a(x, w E) x(6) b(y) dy dé. 


Here w=|n|. The integral J, is of the form studied in Theorem 7.7.7 if y—x 
is taken as a new variable instead of y, the roles of x and y are in- 
terchanged, and another parameter n/w occurs. Note that y(€)=1 near the 
critical point € ='k’(x)n/w. Hence we obtain for I, the asymptotic expansion 


ese 2, <iD,, D,/a>! ef Px) a(x, @ oi ! lye x, Ea tx! (x}q/eos 


if we observe that all derivatives of a(x,wé) can be estimated by w™ in the 
support of the integrand, and that ¢(x)=1 in a neighborhood of the support 
of a,,(«(x), n). The proof is complete. 
The simplest consequence of (18.1.30) is by (18.1.31)' that 
a,.(«(x), 0) — a(x, "k'(x) neS™*. 


In particular, if a is polyhomogeneous with principal symbol a®, that is, if a° 
is the homogeneous. term of highest degree, then a, is polyhomogeneous 
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with principal symbol a? satisfying 
ay (x(x), 4) =a? (x, 'k'(x) 1). 


The principal symbol therefore transforms as a function invariantly defined 
on the cotangent bundle, just as in the case of differential operators dis- 
cussed in Section 6.4. For the next term a! resp.a! we obtain (assuming that 
the step is 1) 


ax (x(x), n)=a"(x,"x'(x)q)+ Ya (x,'x'(x) n) DECI K(x), /a!. 
Ja|=2 


A simpler transformation law is valid for the subprincipal symbol defined by 
(18.1.32) a'(x, £)=a'(x, €)+i/2 ¥ aBM(x, €). 
To compute it for a, we use that 
ay(y,n)=a° (x, 'k'(x) n), y= K(x), 
where (‘x’(x) q),=0<y,n>/0x,, which gives 
Y ae} =D Ox4,/0y; 0/Ox, Ya Oy fx, 
= ads 0x,/dy; Oy;/Ox1+ Y, Ox,/dyj(a™ 6? yj/Ox1 Ox, 
+ ¥ a") 6? Cy, 1>/Ox, OXp Oy ;/OX)). 


If J=det(dy,/dx,) then J(dx,/éy;) is the cofactor matrix of the matrix 
(dy,/0x,) so J~* 6J/0x, = 4? yj/Ox, Ox, 6x,/dy;. Hence 


(18.1.33)  ags(xc(x), m)=a*8(x, "x’(x) q) —3 Y) a(x, 'x'(x) 0) (Di J)/J. 


This may not look much simpler at first sight, but we note that a’ is at 
least invariantly defined at the points in the cotangent bundle where a® 
vanishes of second order. It is also an invariant under measure preserving 
changes of variables, and we shall see later that a slight modification of our 
point of view gives a complete invariant. A more conceptual motivation for 
the notion of subprincipal symbol will be given in Section 18.5. 


Our next aim is to define pseudo-differential operators on a manifold. 
First we must discuss symbols. If X CIR” is open we define S7,(X x IR") as 
the set of all aeC™(X xIR") such that (x)a(x, é)eS"(R" xR") for every 
peCo(X). This means that for every compact set Kc X we can find C,x 
such that 


(18.1.1) lax, WS Capel +1E)"-"s xe K, EER" 


By means of a partition of unity we can immediately extend Proposi- 
tion 18.1.3 to this local case. 

More generally, let [' cIR"x IR" be open and conic with respect to the 
second variable, that is, (x,é)el => (x,té)el if t>0. Then we define S7% (7) 
as the set of all aeC*(I) such that for every compact set K’cI the 
estimate (18.1.1) is valid in {(x,t&; (x,4eK’, t21}. Taking K’= 
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K x {€; |E| 1} we find that this agrees with the earlier definition when [= 
X xR". We shall usually simplify notation by writing S”(I°) instead of S%,(1) 
since there can be no ambiguity unless [=IR" x IR". Symbols behave well 
under a change of variables: 


Lemma 18.1.18. Let X, and X, be open sets in R” and let : X,—>X, and 
@: X,—>GL(n,R) (the group of invertible nxn matrices) be C® maps. If [, 
<X,xR" and I,-X,xR" are open and conic, (o(x), ®{x)é)eD, when 
(x, el, then ; 

a, (x, €)=a,(P(x), B(x) ¢) 
is in S"(Ij) for every a,eS"(L). 
Proof. K,={($(x), ®(x) €);(x,é)EK,} is a compact set in I, if K, is a 


compact set in [,. Since |@(x)é|/|E| is bounded from above and below when 
(x, eK, and 


aP=>' a ®, ;, 81) =D, Fa) 9, /Ox + aS? O®,,/0x;&, 


the required estimates (18.1.1) follow inductively. 


If X CR” is open and aeS”(X xR”), then an operator 
a(x, D): S'(R")> D(X) 
is still defined by (18.1.5), and it restricts to an operator &’(X)—> 9'(X) or 
CP(xX)— C™(X). In fact, if pe CP(X) then P(x) a(x, G)eS” so the correspond- 
ing operator has already been discussed. It follows also that a(x,D) is a 
continuous map H,,. > H(o°_,)(X). However, there is some lack of symmetry 
between right and left multiplication which must be removed before one can 
take adjoints. The situation is clarified by the following 


Proposition 18.1.19. If A: C$(X)— C™(X) is a continuous linear map and for 
all @, weC>(X) the operator 
Sour paApu 


is in Op S™, then one can find aeS"(X x IR") such that 
A=a(x,D)+Ag 


where the kernel of Ag is in C°(X xX). Here a is uniquely determined 
modulo S~™(X x IR"). 


Proof. Let 1=)'w,(y) be a locally finite partition of unity in X. Then 
YA, u=a;,(x,D)u, ueX where a,eS” and aj,(x,¢)=0 when x¢supp ;. 
Set 

a(x, =)" aj, (x, 4) 


where we sum over all j and k for which suppy, suppy,+0. The sum is 
locally finite since any compact subset of X meets only finitely many 


18.1. The Basic Calculus 85 


suppw, and they meet only finitely many suppw,. Hence aeS"(X x R"). If 
K is the kernel of A then the kernel of A —a(x, D) is the sum 


LY" Wj) KO, Wi.) 


taken over the indices for which supp, supp, =9. It is in C°(X x X) 
since the sum is locally finite and the terms are in C® by Theorem 18.1.16. 
Now if beS” and the kernel is in C?(R" x R”), then 


b(x, Jae bx, Dl 


is rapidly decreasing when €— 00 so beS~™. If A=0 we can apply this 
conclusion to b(x, D)u=da(x, D) wu taking @=W=1 near any given point 
in X. This proves that aeS~°(X x IR”) which completes the proof. 


We are now ready to define pseudo-differential operators on manifolds: 


Definition 18.1.20. A pseudo-differential operator of order m on a C” ma- 
nifold X is a continuous linear map A: Cf?(X)— C™(X) such that for every 
local coordinate patch X,cX with coordinates X, 3x ++ K(x) 
=(x,,...,x,J)EX, CIR” and all ¢, we C%(X,) the map 


SFR") 2 ur dK )*An*(pu) 


is in OpS™ We shall then write Ae¥"(X) and extend A to a 
map 6'(X)— B(x). 


_ If X CR" it follows from Proposition 18.1.19 that A must be the sum of 
an operator a(x,D) with aeS"(X xR") and an operator with kernel in 
C™(X x X). Theorem 18.1.17 shows that conversely every such operator is in 
v™(X). Definition 18.1.20 just means that the restriction of A to each coor- 
dinate patch is of the preceding form in the local coordinates. It is of course 
sufficient to know this for so many coordinate patches X, in X that the 
products X,x X, form an atlas for X x X. It suffices to use an atlas for X if 
one requires in addition that the kernel of A is smooth outside the diagonal. 
In particular, if a,eS" and the kernel of a,(x,D) has compact support in 
X,.x X,, then we can define Ae ¥"(X) by 


Au=x*a,(x,D)(k~)*u in X,, Aueé(X,). 


If A is polyhomogeneous it follows from Theorem 18.1.17 that a prin- 
cipal symbol a® is invariantly defined on T*(X)\0, where 0 denotes the 0 
section. It is obtained by just pulling the principal symbol back from 
T*(X,)\0 to T*(X,)\0. To define the principal symbol for any Ae” we 
first define S"(T*( (X)) as the set of all aeC™(T*(X)) such that the pullback 
to T*(X,)=X, xR" is in S"(X, xR") for every coordinate patch. By Lem- 
ma 18.1.18 it is enough to require this for an atlas, and the definition agrees 
with our earlier one if X JR”. If Ae" then the restriction of A to X, 
identified with X, defines a symbol in S"(X, x R"/S~*(X, x IR") by Propo- 


86 XVIII. Pseudo-Differential Operators 


sition 18.1.19. If a,eS"(T*(X,)) is the pullback of a representative then 
A, — AES" ~ 1(T#(X,. 0X,)) by Theorem 18.1.17 for every pair of coordinate 
patches. With a partition of unity {y,} subordinate to a covering by coor- 
dinate patches X,,, we set 


a= a, €S"(T*(X)) 


pK; 
and obtain a—a,¢S"~1(T*(X,)) for every x. This determines a modulo 
S™-* so we obtain a principal symbol isomorphism 


ym(X)/P™— 1 (X)=S"(T*(X)/S"—*(T*(X)). 
To prove surjectivity we take y, now with }’y7=1 and set for aeS"(T*(X)) 
Aju=w xt a,(x, D)(«Kj7*)*(Wju), ueC™(X), 


where a, is the pullback of a to T*(X x)- Then A=) A, has the principal 
symbol a. We also have an isomorphism between ¥~™(X) and the oper- 
ators with C® kernel, that is, operators mapping &’(X) into C®(X). 

To be able to compose operators freely one needs to have some infor- 
mation on the support of the distribution kernel, which is a distribution in 
XxX with values in 1 [Ki Q, that is, a distribution density in the second 
variable. 


Definition 18.1.21. The (pseudo-differential) operator A in X is said to be 
properly supported if both projections from the support of the kernel in 
XxX to X are proper maps, that is, for every compact set K cX there is a 
compact set K’< X such that 


suppuc K =>supp AucK’; u=0 at K’> Au=O at K. 


Note that A can then be extended to a map 9(X)-— Y'(X) so that the 
last property is preserved. 


Proposition 18.1.22. Every Ae" can be written as a sum A=A,+Apo where 
A,é¥" is properly supported and the kernel of A, is in C™. 


Proof. As in the proof of Proposition 18.1.19 we take a partition of unity 1 


=)'w; in X and set 
A, u=)’ WAU) 


with the sum taken over all j and k such that suppy,csuppy,+0. The 
same proof shows that A, is properly supported and that Ag=A-—A, has a 
C® kernel. 


Using the preceding decomposition and Proposition 18.1.3 it is easy to 
show that if A,e'?"/(X), m,| — oo, then one can find Ae”? with 


A—-)} Aje?™ for every k. 


i<k 


The details of the proof are left for the reader. 
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Since the asymptotic formulas in the calculus of pseudo-differential 
operators only allow one to recognize them modulo ¥~%, we shall usually 
be working with ¥"/Y~-© instead of ¥". By Proposition 18.1.22 it is then 
always possible to pick a properly supported representative for the class 
considered. It is therefore no essential restriction that we require proper 
supports in the following immediate consequence of Theorem 18.1.8. 


Theorem 18.1.23. If A,;e"/(X) are properly supported, j=1,2, then A 
=A,A,6¥™*™2(X) is properly supported and the principal symbol is the 
product of those of A, and of A,. 


Proof. Let ¢,yeC§(Y) where Y is a coordinate patch, and choose yeC?(Y) 
equal to 1 in a neighborhood of supp ¥. Then 


PA, A,W=(PA, DHAWtTOA(L-P)A2Y. 


The first term on the right is in ¥" by Theorem 18.1.8 and the other has a 
C® kernel. 


The proof that (iii) = (i), (ii) in Theorem18.1.9 gives with no change an 
extension of Theorem 7.1.22: 


Theorem 18.1.24. If Ae” is properly supported and elliptic in the sense that 
the principal symbol aeS™(T*(X)/S"—'(T*(X)) has an inverse in 
S-™(T*(X)/S~"~ '(T*(X)) then one can find Be ¥-™ properly supported such 
that 

BA-Ie¥-”~, AB-—Ie¥-™., 


One calls B a parametrix for A. ° 


In Chapter XIX we shall discuss the existence theory for elliptic oper- 
ators which follows from the existence of a parametrix. Here we proceed to 
discuss local versions of Theorem 18.1.24. 


Definition 18.1.25. If aeS"(T*(X)) is a principal symbol of Ae” then A is 
said to be non-characteristic at (x9,¢))€T*(X) 0 if ab—1eS~' in a conic 
neighborhood of (x9, ) for some beS—”. The set of characteristic points is 
denoted by Char A. 


The definition is of course independent of the choice of a. The proof of 
the equivalence of (iii) and (iv) in Theorem 18.1.9 shows that in terms of 
local coordinates an equivalent condition is that |a(x,|2c{é|" for large |é| 
in a conic neighborhood of (x9,é,). If A has a homogeneous principal 
symbol a, the condition is equivalent to a(x,,¢9)+0 so our present defini- 
tion of Char A coincides with (8.3.4) for differential operators. 

If AeW" and k<m we shall say that A is in P*, or of order k, at 
(X9,€é,)€T*(X)\0 if for the complete symbol a(x,é) of A restricted to a 
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coordinate patch containing x, we have aeS* in a conic neighborhood of 
(X9,&). By Proposition 18.1.19 and Theorem 18.1.17 this condition is inde- 
pendent of the choice of a and of the local coordinates. The case k= — oo is 
of particular importance: 


Proposition 18.1.26. If Ae¥"(X) and I is a closed conic subset of T*(X)~0, 
the following conditions are equivalent, of denoting the kernel of A: 
(i) A is of order —oo in T*(X)\ ONT. 
(Gi) WF'8) C {M7 € TY. 
(iii) WF(Au) CIN WF(u),u € &(X). 


Proof. The statements are local so we may assume that X CIR” and that 
A=a(x,D), where aeS” and a(x, €)=0, |é|<1. If a is rapidly decreasing in a 
conic neighborhood V of (xo,é9) we can choose géC®(IR” x (IR" \.0)) with- 
support in V so that q(x, €) is homogeneous in & of degree 0 and equal to 1 
in a neighborhood of (xy,&,). Then the kernel of (aq)(x,D) is in C®, and 
(X9,€9,Xo, —&) iS not in the wave front set of the kernel of (a(1—q))(x, D) 
by Theorem 8.1.9. In view of (18.1.24) it follows that (i) = (ii). That (ii) = (iii) 
is a consequence of Theorem 8.2.13. Finally assume that (iii) is valid, and let 
(x9, &9)¢I'. Choose geS° with support in a closed cone F, with F nr =@ and 
equal to 1 at oo in a conic neighborhood of (x9, &). Then WF(q(x, D)u) <I, 
for every ueé’ since (i)= (iii), so WF(a(x, D)q(x, D)u)c I Al, =9. Thus 
a(x, D)q(x,D) has a C® kernel K. Choose ¢eC? equal to 1 in a neigh- 
borhood of x, and set a(x, D) q(x, D) ¢6u=b(x, D)u. By (18.1.27) we have 


b(x, =e" | K(x, y) Py el dy 
and the right-hand side is rapidly decreasing when €-— 00. The same is true 
for the derivatives so beS~™ in a neighborhood of x9. But b—aeS~® in a 
conic neighborhood of (x9,¢,) so it follows that aeS~” in a conic neigh- 
borhood of (x9, €5). Thus (iii) = (i) which completes the proof. 
Since WF’(.x/) is contained in the diagonal of T*(X)\0 x T*(X)\0 it is 
natural to identify it with a conic subset of T*(X)~0. We shall write 


(18.1.34) WF(A)={yeT*(X) 90, (7, YeEWF (aD)}. 


By Proposition 18.1.26 WF(A) is the smallest conic set such that A is of 
order — oo in the complement, we have 


(18.1.35) WF(Au)cWF(A)AWF(u), ueé’(X), 
and no smaller set than WF(A) can be used in the right-hand side. It is 
clear that WF(AB)<WF(A)OWF(B) if Ae P(X), Be" (X). 

We can now state a microlocal version of Theorem 18.1.24: 


Theorem 18.1.24’. If Ae¥™ is properly supported and (x9, ))¢Char A then 
one can find a properly supported Be'¥~™ such that (X9,€).)E WF(BA—I) and 
(Xo, 69) WF(AB—J); these conditions are equivalent. 
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Proof. By Definition 18.1.25 we can choose B,e¥~™ properly supported 
(like all operators in what follows) so that AB,—JI is of order —1 at (Xo, &o). 
This means that AB, =1+R,+R, where R,eP~' and (xo,6.)¢ WF(R,). By 
Theorem 18.1.24 we can find B,¢¥° with (1+R,)B,—-Ie¥-*. Since 
(xo, €,)¢WF(R, B,) it follows that (xo, é))}¢WF(AB—J) if B=B, B,. Similar- 
ly we find B’ with (xo, é,) WF(B’ A—I). Since 


B'~B=(B' A—I)B—B'(AB—1) 


it follows that (x9,é))EWF(B’-B). Hence (xo,é,)¢WF(BA—J), and the 
proof is complete. 


Theorem 18.1.24’ allows us to give an alternative description of the wave 
front set of a distribution. 


Theorem 18.1.27. If ueQ'(X) we have for every meR 
(18.1.36) WF(u)=() Char A 


where the intersection is taken over all properly supported Ae'¥"(X) such that 
AueC™(X). 


Proof. Assume that (x9,¢))¢WF(u). Then we can choose Ae¥” with 
WF(A)QXWF(u)=0 and (x 9,é,)¢Char A, by just working in a coordinate 
patch containing xy. This proves that (\ Char Ac WF(u). On the other 
hand, assume that de¥", AueC® and that (x9, ¢é,)¢Char A. We must then 
prove that (x9,€))¢WF(u). Choose Be ¥~™ using Theorem 18.1.24’ so that 
(Xo, €o)¢ WF(BA —J). Then 

u=BAu+(I—BA)u 


where BAueC®™ and (X9,é))¢WF(I—BA)u) by Proposition 18.1.26. The 
proof is complete. 


From now on we shall always fall back on (18.1.36) as our definition of 
WF(u). With this definition the analogue of Theorem 8.3.1 is obvious: 
Theorem 18.1.28. If Ae ¥"(X) is properly supported and ueQ'(X) then 
(18.1.37) WF(u)c WF(Au)UChar A. 

Proof. If (x, )¢ WF(Au) we can find Be ¥° with BAueC® and (x, &)¢Char B. 
If (x, &)€Char A then (x, &)¢éChar BA so (x, E)EWF(u). | 


Remark. If F is a closed cone < T*(X)\0 we introduced in Section 8.2 the 
space 
BAX) = {ueD'(X), WE(u) cI}. 


In Definition 8.2.2 we introduced a notion of convergence for sequences in 
OX). The preceding arguments show with little change that uj,>u in 
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9X) is equivalent to uj>u in Y(X) and Au;>Au in C™(X) for every 
properly supported A with [A WF(A)=@. We leave the details for the 
reader. 


In appendix B we have defined the space H(f(X) of distributions u on X 
such that (x~')*( u)€H,,(IR") for every local coordinate system x: X 4 “ 
<IR" and every PeCp(X,). The main point was the invariance of H(™?(X,) 
under changes of variables proved in Theorem B.1.8. This is also an im- 
mediate consequence of Theorem 18.1.13 and Theorem 18.1.17. If AeW™" we 
have ApueC*(X \supp ¢), and AgueHl&,,(X,) by Theorem 18.1.13, if 


ueHi(X ), so we obtain, using Theorem 18.1.24 to prove the converse, 


Theorem 18.1.29. If ueH Sy (X ) (resp. Hi3"?(X)) and Ae™ is properly sup- 
ported then AueH'®*_(X) (resp. AueHen?,(X)). The converse is true if A is 


(s—m) —m 
elliptic. 


Thus one can define Hi%f(X) as the set of all weQ’(X) mapped to L7,,(X) 


by every (some elliptic) properly supported operator in ¥*(X). The preced- 
ing discussion can be localized with the following terminology: 


Definition 18.1.30. If ue'(X) then ueH(f at x oeX if u=u,+ug with 
u,e€Hi(X) and u,éC® in a neighborhood of xo. If (xo,é,)€T*(X)N0 we 
say that weHS at (Xo,6o) if u=u,+uy with u,eHy(X) and 
(Xo, Eo)¢ WF (up). 


It is obvious that ueH(gy at x, if and only if @ueH{(X) for some 
peC™(X) with $(xo)+0. The condition ueH{g at (xo, 9) can be expressed 
in a similar way with pseudo-differential operators replacing cutoff func- 
tions: 


Theorem 18.1.31. If ueQ'(X) and Ae'¥"(X) is properly supported then 
(18.1.38) UuEH{Y at (Xo,bo) > AUCH. » at (Xo, So): 


One can choose A so that AueH(?< ,,(X) and (x9,&9)¢Char A. On the other 
hand, 


(18.1.39) AUCH. m At (Xo,Fo) and (X,E>)¢Char A 
=> ueH!'®® at (xo,Eo). 


If ueH{} at (Xo, o) for every E5¢T*~0 then ucHy at Xo. 


s 


Proof. If u=u,+u, and u,eHy(X) then Au,e€H(,)(X) by Theo- 
rem18.1.29, and if (x9,é))¢WF(u,) then (X9,é9)}¢WF(Au,). This proves 
(18.1.38) and also that AueHi,,(X) if WF(uy)A\WF(A)=9. If 
(Xo, €,)€Char A we choose Be ¥~"(X) according to Theorem 18.1.24’ and 


obtain if Aue He) at (Xo, €) that BAueH{S at (xo, Eo), and WF(u—BAu) 
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< WF(I—BA) which does not contain (x9, &,). Hence (18.1.39) follows. As- 
sume now that ue Hier at (xo; ¢) for every e7,* ~0. Since the unit sphere in 
Tx is compact and Char A is closed, we can choose finitely many A,e ¥°(X) 


such that A. jueHes(X) and 
T,X AChar A, ... \Char A, =6. 


Let A, .€W¥° have a principal symbol which is complex conjugate to that of 
A, and set A= vA; A;. Then Ae¥® and A is elliptic in a neighborhood of 
Xo. Using Theorem 18. ce 24 we can choose Be ¥° such that BA —I is of order 
—oo in a neighborhood of x). Hence BAu—ueéC™® in a neighborhood of xo, 
and since BAueH(;(X) it follows that ueH(y in a neighborhood of xp. 


Remark. The discussion above is equally valid for ’H,, for any p. 


Occasionally it is useful to introduce functions in X or in T*(X)\0 
which measure the smoothness of u by means of H_,,, spaces, 


(18.1.40) s,(x)=sup {s; ue Hi at x} 
(18.1.41) s*(x, €)=sup {s; ueH(y at (x, 6}. 
These are obviously lower semi-continuous functions with s,(x) S sx(x, €). If 


S<5i(x,€) for every € € TT \ Othenu € AS at (x, £) for every £,sou € HS at 
x, hence 


(18.1.42) s, (x)= inf s* (x, €). 
é 


By Theorem 18.1.31 we have s4,(x,€)2s*(x,é)—m if Ae", and there is 
equality if (x, ¢Char A. 


We have postponed until now the discussion of adjoints of pseudo- 
differential operators. The reason is that the dual objects of functions are 
densities, as we saw in Section6.3. The adjoint of a pseudo-differential 
operator is therefore a pseudo-differential operator from densities to densi- 
ties, unless a positive density is distinguished which allows identification of 
functions and densities. We must therefore make some comments on 
pseudo-differential operators between sections of vector bundles E and F 
over X. This will also be important in Chapters XIX and XX. 


Definition 18.1.32. Let E and F be complex C® vector bundles over the C” 
manifold X. Then a pseudo-differential operator of order m from sections of 
E to sections of F is a continuous linear map 


A: C3(X, E)> C™(X, F) 
such that for every open YcX where E and F are trivialized by 


op: Ely>YxC*, op: Fly > YxC/, 
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there is a f xe matrix of pseudo-differential operators A;,¢'P"(Y) such that 


(p-(A wly)i= > Ai(Oe4);, ueCo(Y, E). 
We shall then write Ae ¥"(X; E, F). 


Naturally it suffices to assume that there is a covering of XxX by 
coordinate patches YxY such that A,, can be expressed as an operator in 
OpS™ modulo C® in the local coordinates. We leave as an exercise for the 
reader to verify that the principal symbol of A is well defined as an element 
in 

S"(T*(X); Hom(E, F))/S"~*(T*(X); Hom(E, F)) 


where Hom(E, F) is the vector bundle with fiber at (x, é) consisting of the 
linear maps from E, to F,. (See also Section 6.4 for the case of differential 
operators.) We also leave for the reader to convince himself that spaces 
Hi (X,E) of sections of E can be defined as in the scalar case and that 
Ae®"(X; E, F) implies that A: HQi(X, E) > H&°_,)(X, F) is continuous if A is 
properly supported. The obvious extension of Theorem 18.1.23 to operators 
between sections of bundles is also left for the reader. 

In Section 6.4 we defined the density bundie Q on X: a section of Q 
expressed in local coordinates x,,...,x, is a function u such that the mea- 
sure u|dx| is independent of how they are chosen, |dx| denoting the Lebes- 
gue measure in the local coordinates. For the representation u’ in the local 
coordinates x’ we therefore have 


u' |dx'|=u|dx|. 


We can define the powers 9% of Q for any aeC€ by just changing the 
transformation law to 


u’ |dx’|°=u|dx|* 
or, more formally, we take the transition functions 
Sax =ldet(koK’1Y fox’ in X, NX, 


if x and «’ are arbitrary local coordinates with coordinate patches X, and 
X,. We shall now work out the transformation law for the second term in 
the symbol of a polyhomogeneous operator acting on half densities, that is, 
sections of Q?. This means that (18.1.29) must be replaced by 


(18.1.29) ((a, (x, D) u)ox) |J|* = a(x, D)((uck) |J|*) 
where J =det x’(x). Now 


|J|~* a(x, D)(v|J 7) =b(x, D)v 
where 
b(x, €)~ [J] * a™ (x, €) D* |J|#/a! 
=a(x,€)+4¥ a(x, €) D, J/J mod S™~?, 
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The sum cancels that in (18.1.33). For the subprincipal symbol we now have 
the simple transformation law 


(18.1.33) ax (K(x), n)=a"s(x,"«'(x) 9) 


so it is an invariantly defined function on T*(X)~0. The same calculation 
can be applied to any symbol in SS". If we repeat the argument in the 
definition of the principal symbol we obtain 


Theorem 18.1.33. If Ae¥"(X ; Q?,Q*) then A has a refined principal symbol 
a(A)eS"(T*(X))/S"-2(T*(X)) such that if A is defined by a(x,D) in a local 
coordinate system then 

a(A)—a—i/2) aes" ?. 


When A is polyhomogeneous this means that the subprincipal symbol (18.1.32) 
is invariantly defined on T*(X)~0. 


The product of two half densities is a density so the (anti-)dual space of 
C%(X,Q?) is D’(X,Q?). Hence the adjoint of a continuous linear operator 
C®(X, 24) + C%(X, Q*) is a map &'(X, Q*) > OX, Q#). From Theorem 18.1.7 
we now obtain: 


Theorem 18.1.34. Every Ae¥"(X; Q?, Q*) has an adjoint A*e¥"(X ; Q4,Q), 
thus 
(Au,v)=(u, A*v); uu, vECS(X, Q?). 


If ais a (refined) principal symbol for A, then @ is one for A*. 


The (anti-)dual of a complex vector bundle E over X is defined so that 
the fibers are the (anti-)duals of those of E. Thus the transition matrices g;; 
of E are replaced by ‘g;;' for the dual and g*~* for the anti-dual E*. We 
also define E @ Q? as the vector bundle with transition matrices obtained by 
multiplying those of E and of Q#, the latter being scalars. If 
uEeC™(X, E@ Q?) and veC™(X, E* @ 2%), then (u(x), v(x))e C°(X, Q) and can 
be integrated over X if the support is compact. The following is an obvious 
extension of Theorem 18.1.34: 


Theorem 18.1.34’. If FE, F are complex vector bundles, then every 
AeW¥™(X; E@ Q3, F@Q4) has an adjoint A*e'P"(X ; F* @ Q#, E* @® MQ), 
(Au,v)=(u,A*v); ueC2?(X,E@Q4), veCR(X, F*@0). 
If ais a principal symbol for A then a* is one for A*. 
Another advantage of always having a half density bundle factored out 
is seen in the form that the Schwartz kernel theorem (Theorem 5.2.1) takes 


for manifolds: Every continuous linear map C9 (X ,E@Q3)> FY, F @Q3) 
has a kernele9’(Y x X, Hom(E, F)@ Q}, x) where (Hom(E, F)),,, is the space 
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of linear maps from E, to F,. The verification is obvious once one satisfies 
oneself that the converse is true. It was only to avoid being involved in such 
discussions that we stated Theorem 8.2.12 and the following results in Sec- 
tion 8.2 only for open subsets of IR”. 


The results proved in this section are valid for more general symbol 
spaces. In particular one can use some of the spaces S®, obtained when 
(18.1.1) is replaced by 


(18.1.1)" lax QS, p(L+eyr Perel; x, SelR". 


Here 0<p<l and 0<6<1. These spaces were already introduced in Sec- 
tion 7.8. The basic reason for their interest is that by Theorem 11.1.3 we 
have estimates of the form (18.1.1)” for 1/P(é) if P is hypoelliptic and |€| is 
large. Since S¥ ,> St. 9=S™", Proposition 18.1.3 and 18.1.4 are valid with no 
change, and so is Theorem 18.1.6. The asymptotic series in (18.1.11) is only 
defined when 6<p, but Theorem 18.1.7 is valid when d<p apart from the 
breakdown of (18.1.11) when d=p. The same is true for Theorem 18.1.8 
whereas in Theorem 18.1.9 we must assume that <p. Theorems 18.1.11 and 
18.1.13 are valid when 6<p but the proofs given are only applicable when 
6<p. In Theorem 18.1.14 the hypothesis should be replaced by aeS?%*?~?, 
Changing variables requires an additional condition, for Lemma 1811. 18 is 
only valid when 1—p<6, that is, p2=1~—65. When 5<p this implies p24 

with equality only when 5=4 also. Now Theorem18.1.17 remains valid 
when 1—p<6<p except for the asymptotic expansion which we only have 
when d<p. The rest of the section is really just formal and requires no 
change. The notation ¥",(X; E,F) is used for the pseudo-differential oper- 
ators based on Si. 

We shall not Carry out the proofs of the preceding statements. They may 
be supplied by a reader wanting to consolidate his grasp of the material in 
this chapter. Alternatively, the classes S% , may be regarded as very special 
cases of the general classes of pseudo-differential operators discussed later 
on in this chapter. (See the end of Section 18.4.) However, we shall prove a 
technically useful result concerning products of pseudo-differential operators 
in IR" and pseudo diflerential operators in x’=(x,,... JeIR"~! depend- 
ing on the parameter x, 


Theorem 18.1.35. Let acS"(R" x R"), bes™ (IR" x R"~'), and assume that for 
some &>0 we have 


(18.1.43) a(x, C)=0 if e€,|>1 and |¢'|Sel¢,]. 


Then a(x, D)b(x, D’) and b(x, D')a(x, D) are in Op(S"*™), and the asymptotic 
expansion of the symbols can be obtained from (18.1.15). 


Xn 


Note that in the estimate 


[DE DEb(x, ESC, (1 +16" 
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we can replace |¢’| by |¢| in the set where ¢/€,|</C'|, for 
1+|[O]sitle|s( tle + 1/e). 


The asymptotic series (18.1.15) for the compositions are thus well defined. 


Proof of Theorem 18.1.35. Choose yeC™(IR") homogeneous of degree 0 out- 
side a compact set so that |e€,|>1 and |é’|<|je€,| if €esuppy, and x(Q)=1 
when {e€,/22 and 2/é’|S\eé,|. Then 
bi (x, 2) =b(x, &)(1 —x(2)es™ 
and b,=b in a neighborhood of suppa. Hence b,(x, D)a(x,D)eOp S"*” 
and the asymptotic expansion of the symbol is given by the usual formula 
for b(x, D) a(x, D). By (18.1.43) we have y(D) a(x, D)=c(x,D) where ceS~*. 
Now 
b(x, D') c(x, D) ef = b(x, De!” e(x, €) 
=e!*nén h(x, D')c(x, D', €,) eb OF? 
=e r(x, 6) 


where r(x, D’, €,)=b(x, D’))c(x, D’,é,) is a composition of pseudo-differential 
operators in n—1 variables containing x, and ¢, as parameters. Since 
Di, b(x’,x,,¢) is uniformly bounded in S™(IR"~! x IR*~*) when x,eIR, for 
any j, and (1+|é,|)* Di, DE c(x',x,,&,€,) is also uniformly bounded in 
S~%(R"-! x IR"-') when (x,,é,)€IR?, for any j,k and N, it follows that 
reS~ °UR" x IR"). We have 


b(x, D)c(x, D)u=r(x,D)u,  ueS’, 
since both sides are continuous in ¥’ and the equality holds for exponen- 
tials. Hence 
b(x, D’) a(x, D)=b (x, D) a(x, D)+r(x, D)eOp S"™*” 
and the symbol has the usual asymptotic expansion. Thus b(x, D’)* a(x, D)* 


belongs to Op S"*”", which implies that a(x, D)b(x, D'})eOpS™*™; the sym- 
bol has the usual asymptotic expansion. 


The kernel of b(x, D’} is equal to d(x, —y,) K(x, y’) where K is defined by 
the oscillatory integral 


K(x, y= (2m) fF? B(x, ode 


Even if b is of order — oo so that KEC”™ we can only be sure that the wave 
front set is in the conormal bundle of {(x, y); x,=y,} (Theorem 8.1.5). How- 
ever, these are always the only singularities which can occur besides those 
for pseudo-differential operators: 


Theorem 18.1.36. If beS"(IR" x IR"~') is of order — © outside the closed cone 
'cR"x(IR"-! 0), then the wave front set of the kernel of b(x,D') is 
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contained in the union of {(x,x,&,—€&); (x,@)eI} and {(x,y,€—&); x,=Vps 
€’=0}. Thus we have for ue€’ 


(18.1.44) WF(b(x, D’)u) < {(x, He WF\u); (x, JEP} 
U{(x,0,€,)5 (7 X,.0,€,)€ WF (u) for some y’}. 
On the other hand, 
(18.1.45) WF(u) < WF(b(x, D’)u) 
U {(x, &); € =0 or (x, &eChar b}. 


Proof. By Theorems 8.1.9 and 8.1.5 
WF(K)c {(x, ys g, —€); x’ =y, €,=0, (x, é)eT}, 
WE(8(X,—Yn)) = {00.956 —6)5 Xp =Yns C=O}. 


The statement on the wave front set of the kernel of b(x, D’) is therefore a 
consequence of Theorem 8.2.10, and (18.1.44) follows from Theorem 8.2.13. 
To prove (18.1.45) we choose aeS°(IR" x IR") with support in a compactly 
generated cone which does not intersect the right-hand side of (18.1.45). 
Then a(x, D)b(x,D)ueC® by Proposition 18.1.26, and a(x, D)b(x,D’) 1s a 
pseudo-differential operator which is non-characteristic where a is. Hence 
WF(u) <Char a, which proves (18.1.45). 


18.2. Conormal Distributions 


By (18.1.7) the Schwartz kernel of an operator in OpS” is an oscillatory 
integral of the form 


(18.2.1) K(x, y=(2n)-" fel" a(x, dé; x, yER"; 


where aeS"(IR” x R"). It is singular only at the diagonal in IR” x IR" where 
the wave front set is contained in the conormal bundle (Theorem 18.1.16). In 
this section we shall discuss a corresponding class of distributions associated 
with an arbitrary C® submanifold Y of a manifold X, starting with the case 
of a vector space and a linear subspace. To see where we should aim we 
introduce in (18.2.1) new variables x’=x—y, x’ =x, so that the diagonal is 
defined by x’=0, and obtain the distribution 


(18.2.2) (22) fel a(x”, Ed€. 


Another choice of x” variable, say x’’=y would have made a dependent on 
both x’ and x” though. We change notation now so that IR?" becomes R"” 
with the variables x =(x,,...,X,) split into two groups x’=(x,,...,x,) and x” 
=(X,44,--->X,). First we show that it does not really matter if one allows a 
in (18.2.2) to depend on x’ or not. In doing so we also note that in 
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Definition 18.1.1 and the following statements of the properties of symbols it 
is of course irrelevant that there are as many x variables as é variables. 


Lemma 18.2.1. If aeS"(IR” x R*) and u is defined by the oscillatory integral 


(18.2.3) u(x)=f ef 8? a(x, E) de’ 

then we also have 

(18.2.3) u(x)=f ef 8? a(x", E)de&’ 

where GeS™(IR"—* x IR") is defined by 

(18.2.4) G(x", =e HPP a(x, Evo 
and has the asymptotic expansion 

(18.2.5) A(x", O)~}) < iD, Dey al%, EV "yo: 


Proof. Assume first that aeX% Then ue ¥ and (18.2.3) means that (27)* a is 
the Fourier transform of u with respect to x’, 


G(x”, E)=(20)~* ff ef 9-5 a(x, 0) dO dx’ 


— e—i<Dx, De’) a(x, 2) x= 0° 


The last equality follows from the discussion preceding Theorem 18.1.7. 
Now the map . 
ar e~i<Pxs De) a(x, ') 


is continuous in “’ and maps bounded sets in S” to bounded sets; this 
follows from Theorem 18.1.7 since the presence of the parameters x” is 
obviously immaterial. For a general aeS” we can take a sequence ayeS 
which is bounded in S” and converges to a in ’, that is, uniformly on 
every compact set, and conclude that (18.2.3) is always valid with @ defined 
by (18.2.4). The asymptotic expansion also follows from Theorem 18.1.7. 


We shall now determine the precise regularity properties of distributions 
of the form (18.2.3) with aeS”. Noting that a density on the subspace 
defined by x’=0 can be written in the form (18.2.3), with a independent of €’, 
we first prove an extension of Theorem 7.1.28. 


Proposition 18.2.2. Let ¢€C,(IR") be equal to 0 in a neighborhood of 0, let 
a(x", )eS™(R"—* x IR") vanish for x’ outside a compact set, and let u be 
defined by the oscillatory integral 


(18.2.6) a(xj=fe**? alx’, E)de’. 
Then tie? 


loc 


(18.2.7) Sla(Q)i7 IP(E/RdESCRP™, ROI. 


and 
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If aeS®,,(IR"~* x IR‘) and ay is the principal symbol, m'=Rem, then 
(18.2.8) lim Ro*~?™(2n)-"~* f [a(S)|? H{(E/R) dé 


= [flag(x”, 2)? O(€, 0) dx! de’. 


Proof. Let @ be the Fourier transform of a with respect to x”. Then a(€) 
= (2 2)* a(é", 2’), so 
J 1a(E)I? (E/R) dg = (2 0)?" f 14(E", EY? G(E/R) dE 
=(2n)?* Rf |a(e", REN? O(2', 2"/R) dé. 
Since |x”| is bounded in the support of a we have for every N 
aE", ENS Cyl +16") +161)”. 


We can write 6=¢,+, where $,e€C,(R") and |¢'|2c in supp ¢,, |¢"|2¢ 
in supp @,, for some c>0. When ¢ is replaced by |¢,| we have |€”|>cR in 
the support of the integrand and the integral is rapidly decreasing as R — oo. 
When ¢ is replaced by |@,| then |¢’|=c in the support of the integrand, and 
(18.2.7) follows. If a is polyhomogeneous then 


R-2™ (ial, RE) (6, C'/R) aE 
=fla(e”,REYR"l? O(E,6"/R)de 
> fldg(€", C1? (6, 0) dé 
=(2n)"* [flag (x”, 2)? @(€,0)dx" dé’ 


by dominated convergence and Parseval’s formula. This proves (18.2.8). 
The singularities of u lie in the plane x’=0: 


Proposition 18.2.3. If the hypotheses of Proposition18.2.2 are fulfilled and 
ye Ce (IR") is equal to 1 in a neighborhood of 0, then (1 —y(x’))ue A(R"). 


Proof. If «=(a,,...,%, 0,...,0) then 
x u(x)=fel"#(—Dp Pale”, Ede 


is a bounded continuous function if m—|a|<—k so that the integral is 
absolutely convergent. If m—|a|<—k—v then the derivatives of order <v 
are also bounded and continuous. This proves the statement, for 
(1 —z(x’)) |x’|-?% has bounded derivatives of all orders for any N. 


From Proposition 18.2.3 we see that the behavior of # at infinity exam- 
ined in Proposition 18.2.2 depends only on the regularity properties of u 
when x’=0. Using Definition B.1.1 of the Besov spaces °H,,) we can state 
(18.2.7) in the form 


(18.2.7)' ue? myn (R") if aeS”, 
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(Note in particular that if u is the kernel of an operator in OpS” in n/2 
variables, then ue°H_»_n/4):) The regularity property (18.2.7) is preserved 
under suitable differentiations of u. In fact, if u is defined by (18.2.3) then 


Dyu(x)=fei8? D, a(x, E)de; jf >k; 
x, D,,u(x)=fel"8?(x,D,,—Dz Eales e)des ijk; 
so these operations preserve the form (18.2.3) and the order of the symbol a. 
(The order of the factors x and D is not important since x,D,;—Djx,=id;.) 
We shall now prove a converse: 
Lemma 18.2.4. Let we &'(IR") and assume that 
x* DP ue? Hy m—4i2) 

for all « and B with |a'|=|B’|. Then u is of the form (18.2.3) with an amplitude 
Ge S™(IR"=* x R*), 
Proof. By hypothesis the Fourier transform @ is in C® and 


J | Dae? désC,,R'"**, R21, |w'l|ZI6', 


R/2<|é|<2R 


for the order of x* and D* is irrelevant, as just pointed out. Taking |f’|=0 
and |B’|>N-+m+k/2 we conclude that for any N 


7 | Del? dns Ca,wlel-™™ if O12 1c". 
ie 


By Lemma 7.6.3 for example it follows that |D*a(2|S Cy, wlEl7%, if [2"|>1e. 
To deal with the opposite case we introduce 


Up(d) = a(R C7, ”)/R™ 
and observe that 
J |G? D* UR(E)? dESC,g, lo’ =16'| 


if Ep is the ellipsoidal annulus defined by 4<|¢'|?+{&”/R|?<4. Since the 
maximum of |2’*'| when |f’|=|a’| is bounded from below in a neighborhood 
of the unit sphere in IR“, we obtain for all « and N using Lemma7.6.3 


ID? UREISC WAHEED), lel=t, (Ee <R. 
Returning to the original scales we have proved the estimate 
D*a(EWS CY (LHS + ler 
for all €. Hence 
A(x”, E)=(Qn)-" f a(S eb? de” 


is in S"(IR"~* x IR*) as claimed. 
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The vector fields considered in Lemma 18.2.4 are tangential to the plane 
defined by x’=0, and they generate all such vector fields: 


Lemma 18.2.5. Any C® vector field in IR" which is tangential to the subspace 
defined by x'=0 can be written in the form 


XY ail) *0/0x;+ Y, a(x) 0/0x;, 
k k+1 


L378 


where a,, and a; are in C™. 


Proof. Let ¥ ax) d/ox, be a C® vector field which is tangent to the 
subspace defined by x'=0. This means that a,(0,x”)=0 when j<k. Hence 
Theorem 1.1.9 (with parameters) gives that for some a,,eC™ 

ax)= XY a; (x) x; isk, 


isk 


which proves the lemma. 


Let X be an open set in R" and let ueD’(X) be defined by (18.2.3) where 
aeS™(X x R*). Then it follows from Lemma 18.2.5 and the remarks preced- 
ing Lemma 18.2.4 that L, ... Lyu€° Ho 42)(X) if Ly,...,Ly are any num- 
ber of first order differential operators tangential to the plane x’=0. The 
converse follows from Lemma 18.2.4. In fact, if yj;eCo(X) have locally finite 
supports and )) w7=1, then Lemma 18.2.4 gives 


julxpa feb? G(x’, E)de" 


where d,eS"(IR"~* x IR“), and (18.2.3) follows with a=) w,4;. Thus we are 
led to the following definition. 


Definition 18.2.6. Let X be a C® manifold, E a C® (complex) vector bundle 
over X and Y a closed C® submanifold of X. Then the space I"(X, Y; E) of 
distribution sections of E, conormal with respect to Y and of degree m, is 
defined as the set of all ueQ’(X, E) such that 


(18.2.9) Ly... Lyue?H', nja(X BE), n=dimX, 


for all N and all first order differential operators L,; between distribution 
sections of E with C® coefficients tangential to Y. The topology is the 
weakest one which makes the maps ur+L, ... Lyue°H(°,,_ 9/4) Continuous. 


Recall that the principal symbol of L, is a linear function on the fibers 
T,;*(X) with values in the linear transformations in E,. The condition in 
Definition 18.2.6 is that it vanishes on the conormals of Y. The normaliza- 
tion in (18.2.9) has been chosen so that the kernel of a pseudo-differential 
operator in ¥"(X; Q? @ E,Q? @ F) is in I™(X x X, 4; Q? @ Hom(E, F)), where 
A is the diagonal in X x X. 
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It is sufficient to take the operators L, in (18.2.9) among a system of 
generators, that is, a set M,,...,.M, of first order differential operators 
tangential to Y such that every tangential operator is of the form 


(18.2.10) L=)a,Mj+a, 


where a,eC°(X, Hom(E, E)). In fact, we can replace Ly by such a sum in 
(18.2.9). This gives a sum with one factor L, less if we replace Ly_, by the 
product Ly_,a,;. Continuing in this way we obtain at last a sum of terms of 
the form aM,,...M;,,u with aeC*(X,Hom(E, £)). In particular, we can 
always take all M, with principal symbols proportional to the identity. 


Theorem 18.2.7. If ueI"(X, Y; E) and Aew™ (X ;E,F) is properly supported, 
then AueI™*" (X, Y; F). 


Proof. Let L, be first order differential operators between sections of F 
which are tangential to Y and have principal symbols proportional to the 
identity. We have to show that 


L,...Ly Aue®H' 


(-—m—m’ 


_njay(X> F). 


If N=0 this follows from the continuity of A from “H(%,,_,4) to 
He mm —nja): If N>O we choose a first order operator Ly on sections of E 
whose principal symbol is the same multiple of the identity as that of Ly. 
Then 


LyA=ALy+Ao 


where A,e¥™(X;E,F). Since Lyuel"(X,Y;E) the proof is reduced to a 
smaller value of N and therefore it follows by induction. 


We can now show that I"(X, Y;E) does in fact consist of the distri- 
butions which are locally of the form we first set out to study. 


Theorem 18.2.8. ucI"(X, Y; E) if and only if y,uel"(X, Y; E) for every wy, in a 
partition of unity on X. If X is an open set in IR” and Y is defined by x’ 
=(x,,...,%,)=0 while E=X x C%, then any ueI™"(X, Y; E) with compact sup- 
port is of the form 


(18.2.11) u(x)=f 8? a(x”, E) de 


where ae S™*"—2/4(R"—-* x R* C9). Conversely, every u of this form is in 
I"(X, Y:E). 


Proof. The first statement is an immediate consequence of Theorem 18.2.7. 
The second one follows from Lemma 18.2.4, and the final statement is a 
consequence of Proposition 18.2.2 and Lemma 18.2.5. The proof is complete. 
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Next we shall introduce a principal symbol for elements in I(X, Y; E). 
In doing so we start with the simplest case where E=?, the half density 
bundle on X, and we take X CIR”. If u has compact support and is of the 
form (18.2.11) with aeSmt-2"/4(R"—* x R*) and principal symbol a, then 
(18.2.8) gives if m’=Rem 


ree) lim Ro?" "(2m)" [}(E))? G(E/R) dE 
=(2n}' flag(x", E91? (6, 0)dx" dé. 


Since wu is a half density in IR” it follows from Parseval’s formula that @ 
transforms as a half density under linear changes of variables. The left-hand 
side is therefore invariant under such changes of variables. Now the normal 
bundle of the plane x’ =0 is parametrized by 


(18.2.12) 0", €) > (0,x”,€',0) 


so it is natural to expect a,(x”, é') to define invariantly a half density there, 
making the right-hand side of (18.2.8) also invariant. Since the codimen- 
sion k does not occur in the left-hand side we want to make it disappear in 
the right-hand side by including a factor (2)? in the principal symbol, or 
rather inserting a factor (22)~"/? in (18.2.11). If u is the kernel of a pseudo- 
differential operator of order m in n/2 variables this does not quite give the 
customary factor (22)~" in (18.1.7) so we take an additional factor (22)~"/* 
to get agreement. Thus we change (18.2.11) to 


(18.2.11) u(x) =(2m)— +214 fei ax”, EN de, 
qegmta—2b/4 
We want to show that the half density 
a(x”, &')|dx"|* |de"|* 
which this defines on the normal bundle of the plane x’=0 with the 


parametrization (18.2.12) is invariantly defined modulo symbols one degree 
lower. 


Theorem 18.2.9. Let X and X,, be open subsets of IR" and let x: X >X,, be a 
diffeomorphism preserving the plane Y={x;x' =O}. Let u,e8&(X,) and let u 
=|detx’|*x*u,e8(X) be the pullback to X as a half density. If 
u,el™(X,, Y), 


(i 8.2.1 1)’ u,.(x) = (2 m)— "+ 2ky/4 f eid a, (x”, é’) dé’, 
then ueI™(X, Y) is of the form (18.2.11) with 
(18.2.13) a(x”, 2’)—a,(«2(0, x"), "ky (0, x")? 6) det x, (0, x”1-# 


- [det x,,(0, x”)PeS™*— 204-1, 
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HG 
ree C 1 i) 
Koi Ko2 
are the splittings of x and of x’ corresponding to the splitting of the variables; 
we have «,,(0,x”)=0 since x,(0,x")=0. 


Here k=(k,,k,) and 


Proof. Since x , (0, x’)=0 it follows from Theorem 1.1.9 that we can choose a 
C” function w with kx k matrix values so that 


K(x', x”) = (W(x) x’, Kz (x). 


Then (0, x")=x;, ,(0,x”) is non-singular. Since u, and u are in C® when x’ 
+0 we may shrink X so that this is true in all of X. Now we have 


u(x) = |det x'(x)|* (2m) * 744 f eh HO a (19 (x), dn! 
= (20) t2WI4 FEED (ic 4(x), W(x)? SY det x’(x)|F/Adet w(x) dé’. 


Here we have put 7’='y(x)~!& in the oscillatory integral. It follows from 
Lemmas 18.2.1 and 18.1.18 that u is of the form (18.2.11) with 


(18.2.14) a(x", c’) =e P8? a, (ie 2(x), W(x) * 6) [det x'(x)[?/Idet Wl =o. 


Modulo S"*+“~2/4-1 this is given by the first term in (18.2.5), that is, 
a, (2(0, x”), 'W(0, x”) * &) [det «’(O, x”)|* [det WO, x”)|-?. 


Since |det x’(0, x”)| =|det yO, x”)| |det x, (0, x”)| because x’(0, x”) is triangular, 
we obtain (18.2.13). 


The half density a(x”, €’)|dx’”|? |dé|? on the conormal bundle N(Y) of Y 
should be considered to have order m+n/4 when aeS"+"/4-*/2, In fact, a 
function f on N(Y) is homogeneous of degree u if M*f=t"f, t>0, where 
M,(x", €)=(x",t€) denotes multiplication by t in the fibers. If we define 
homogeneity of a half density f by the same condition then the half density 
|dx”|* |dé|? corresponding to a=1 is homogeneous of degree k/2 where k is 
the number of ¢’ variables. 


Definition 18.2.10. Let V be a real C® vector bundle of fiber dimension k 
over a C” manifold Y. Then the space S*(V,Q+) of half density valued 
symbols on V of order p is the space of half densities which in a local 
coordinate patch «: Y,,— Y,.C R? with local coordinates yelR? take the form 
a(y,n) |dy|* |dn|* with aeS*-*/?(¥, x R*) if V is identified with Y, x R*. 


With this terminology Theorem 18.2.9 means that the correspondence 
a+ u given by (18.2.11)’ gives rise to an isomorphism 
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smn4(N(Y); OF y)/S"*"/4- 1 (N(Y); Qi) 
— I™(X, Y; QB I"-1(X, Y: 24). 


There is no difficulty in extending this result to distributions with values in 
other vector bundles than 2% because multiplication of u by a transition 
matrix will just affect the symbol in the same way. Thus we have: 


Theorem 18.2.11. Let X be a C® manifold, Y a closed C® submanifold and E 
a C™ complex vector bundle over X. Then there is an isomorphism 


S"*"/4(N(Y), Qh) @ B)/S"*"!*—*(N(Y), Qi @ EB) 
— I"(X, Y; Q$ @ E)/I"~1(X, Y; QE @ E) 


defined locally by (18.2.11). Here E is the lifting of the bundle E to N(Y) (the 
fiber of E at (y,n) is equal to E,). The image under the inverse map is called 
the principal symbol. 


Note that we have half density bundles on different spaces here. This is 
why it is convenient to factor out a half density bundle from the beginning. 
Also note that the codimension of Y has now disappeared. The presence of 
the term n/4 in the degree of the symbol:is caused by our insistence on 
agreement with the degree of pseudo-differential operators. In that case we 
did not have a half density as symbol. However, the normal bundle of the 
diagonal in X x X is isomorphic to the cotangent bundle in X so it has a 
natural density defined invariantly by the symplectic form and given in local 
coordinates by |dx||dé|. Hence there is a natural half density |dx|* |dé|* of 
order (dim X)/2=dim(X x X)/4. When the function defined as principal 
symbol in Section 18.1 is multiplied by this half density the order is raised as 
in Theorem 18.2.11. In general there is no natural half density in N(Y) 
permitting us to identify half densities with functions. 


By Theorem 18.2.7 we know how pseudo-differential operators act on 
I™(X, Y; E). We shall now determine this operation more explicitly on the 
symbols. To do so it is of course sufficient to work locally, with a trivial 
bundle. 

Thus we assume that ueé’(IR”) is of the form (18.2.11)'. Let p be in 
SIR" x xR’). To compute p(x, D)u we first observe that Fourier’s inversion 
formula gives 


a(6)=(2n)fe- "+ 2BI4 fay”, Ee HED dy”. 
Hence we obtain, taking p and a in F at first, 


ple, D) u(x) =2apk*- 42804 ff oD ple, Caly’, Cet? dE dy" 
Ba seF nt tea Gag yal 
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where 
a(x, £))=(2mjt-* [J f"-"8 pw, aly’) de" dy” 
= el Dy De) n(x, Ea(y", Oly ax’ e720 


by the calculations preceding Theorem 18.1.7. The proof of Theorem 18.1.7 
gives with no essential change that a,eS™*™ *"/4-*/? and that 


a,(x, )~)) iD, De» p(x, aly, EVI Nye ee noe 
Hence it follows from Lemma 18.2.1 that 


(18.2.15) p(x, D) u(x) = (22) - 7 24/5 [ 8 B(x”, ENE’, 
(18.2.16) B(x", 6’) = ef Py Per — Par 8) n(x, Eh aly", E Nyaa xen 0 
— VKiD,-, Dey = <iD,, D>) p(x, é) aly”, evi oo xx’ = E"= 0° 


We have now proved 


Theorem 18.2.12. If ueI"(X, Y;E) and Pe¥" (X;E,F) is properly supported, 
then PueI™*™ (X,Y; F) and the principal symbol is that of u multiplied by the 
restriction to N(Y) of the principal symbol of P. If u has compact support in a 
local coordinate patch where Y is defined by x'=0 and u is given by (18.2.11Y, 
then the complete symbol of Pu is given by (18.2.15), (18.2.16) where p is the 
complete symbol of P. 


So far we have made no comments on the polyhomogeneous case. 
However, it is perfectly clear from the formulas (18.2.14) and (18.2.16) that 
everything said is applicable then since the step is the reciprocal of an 
integer. We shall need this remark in what follows. 

In the theory of boundary problems one encounters the following situa- 
tion. X is a C® manifold, Y is the closure of an open subset of X with C” 
boundary éY. Let E, F be vector bundles on X and P a properly supported 
operator in Py,(X; E,F). We want P to induce a map C%(Y, E)> C®(Y, F) 
which is not always the case. 


Definition 18.2.13. P is said to satisfy the transmission condition with respect 
to Y if for every ueC™(Y, E) the restriction of Pu, to the interior Y° of Y is 
in C(Y°, E), that is, has a C® extension to X. Here uy=u in Y and uy=0 
in XN Y. 


We want to determine the conditions on the symbol required for P to 
satisfy the transmission condition. The question is obviously local so we 
consider the case where X CR", Y is defined by x, 20, and E, F are trivial 
bundles. We assume also that the support of u is compact in Y. Then 


(18.2.17) Uy (x) =(2m)~* f ef***1 a(x", Ede, 
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where x” =(x,,...,X,) and 


(18.2.18) a(x”, é,)= fu(xje"™dx,~ —i> Ep *—* Di u(0, x”) 
0 0 


is in S5,,(R"~' xR’). The asymptotic expansion follows by repeated partial 
integrations of course. Thus u,é15,7*?/4(X,0Y). In the symbol a the terms 
of odd (even) order are odd (even), and by appropriate choice of u we can 
get modulo S~~ any symbol aeS3,,(IR"~' xR’) with this property and 
compact support in x”. Now assume that Pe}, where m is an integer. 
Then Pu is of the form (18.2.17) with a new amplitude beS™,,* given by 
(18.2.16). Without affecting the asymptotic expansion of the symbol we can 
multiply by a function in C?(X) which is 1 in a neighborhood of supp u to 
make the support compact. We shall now prove a lemma which is closely 
related to the Paley-Wiener-Schwartz theorem (cf. Theorem 7.4.3). 


Lemma 18.2.14. If ve€'(X), X cR", and 
v(x)= fe d(x", €) dg, 


with beS#, (IR"~'! x IR‘) for some pe, that is, 


phg 
b~D bx", 4) 
0 


where b, is homogeneous of degree w—j, then v|,,,9 has a C® extension to 


the closed half space x, 20 if and only if for every j 
(18.2.19) bx", —I)=b,(x", let@—9. 
This means that b; is the restriction to R\ 0 of bj(x”, EP TS with ce a equal to 


1 at 1 and analytic in the upper half plane. 


Proof. a) Sufficiency. Let I be the curve in € consisting of the real axis with 
(—1,1) replaced by the half unit circle in the upper half plane. Then 


v— YS feb x", DG -Fdlec” 


I<NT : 
if N>Rep+v+l. Writing the terms in the sum as 


Di fe B(x", EIKdC, 
r 


with Re »—j—k< —1 and using Cauchy’s integral formula we find that they 
vanish when x, >0. Hence all derivatives of v are bounded when x, >0. 

b) Necessity. By Borel’s theorem (Theorem 1.2.6) we can find weC9(X) 
equal to v when x, >0. Then 


byl(6,)=(2m)~* Cv —we 8 b(x"),  PECH({xEX 5 x, =0}), 
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iS an entire function and 
lbg(CISCU+IC)"%, Iml,20, 


by the Paley-Wiener-Schwartz theorem (Theorem 7.3.1). On the real axis we 
have by(té,)t~" +b} (é,), t> + 00, where 


DS (E )=f Box", €,) P(x") dx". 
Hence 
by(C,)ozMi+el,)- MM 


tends to 0 at oo in the upper half plane and has a bound on I which is 
independent of s. By the maximum principle it follows that there is also a 
fixed bound above I’. Thus 


Ibg(C IS CIC, IC,j>1, Im, 20. 
Now we can choose t,-+ + 00 so that 
Bo ,)=lim by (t, ¢,) t7" 
exists and is analytic in the upper half plane. The boundary values on the 
real axis are b$(é,) in view of say (3.1.13) with N=0. Hence B(f,)=b$(1) Cf, 
Osarg¢, <7, which proves that b, satisfies (18.2.19). By part a) of the proof 
we can now subtract a distribution corresponding to b, and vanishing when 


x,>0 and then conclude that b, satisfies (18.2.19) and so on. The proof is 
complete. 


Suppose now that Pe ¥,,(X) and that the symbol has the expansion 


yi; 

0 
where p; is homogeneous of degree m—j. Then Puy is of the form (18.2.15) 
with b given by (18.2.16) and a defined by (18.2.18) If P satisfies the 
transmission condition with respect to the half space x, 20 it follows from 
Lemma 18.2.14 that the principal part p,(0,x”,€,,0)u(0,x)/ié, must satisfy 
(18.2.19), se 

Po (0, x”, —1,0)=e*'" p,(0, x”, 1,0). 


Note that this condition would still have followed if we had weakened the 
transmission condition by demanding in Definition 18.2.13 that u vanishes of 
some fixed order on GY. This weakened transmission condition remains 
valid if P is multiplied to the right or to the left by any differential operator, 
for Duo will not contain any terms supported by the plane x,=0 if u 
vanishes at least of order k. In particular, the commutators of P with D; and 
x; any number of times satisfy this weakened transmission condition, so it 
follows in view of (18.1.6) that for arbitrary a, G we have 


(18.2.20) pe), (0, x”, —1,0) =e" l2) pO, (0, x”, 1,0). 
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Conversely, (18.2.20) guarantees in view of (18.2.16) and Lemma 18.2.14 that 
the operator with symbol p, satisfies the transmission condition. Subtracting 
it from P we conclude that p, has the same property and so on. Hence 


(18.2.20)' p%p(0, x", —1,0) =e%-#I-J p (0, x”, 1,0) 


is a necessary and sufficient condition for P to satisfy the transmission 
condition. 
To put (18.2.20)' in an invariant form we introduce for peS” 


P~S Dp 
ce) 


where p, is homogeneous of degree m—j, a new symbol p such that 


(X x IR"), 


phg 


(18.2.21) B(x, €) nS e-mim—D p,(x, —€). 
0 


Clearly p—e-?*™peS-~ so prre™™p defines an involution of 

Shng(&  IR")/S~°(X x IR"). It gives rise to an involution of PY (X)/P~ (xX) 
for any C®” manifold. Indeed, since g=q if q is any polynomial, we obtain 
with the notation of Theorem 18.1.17 


(P,) (2), ~Y p(x, —'e’'(x) ne" -F “HO G(x, n/a} 


which is equal to the expansion of (p),(«(x),y). A similar calculation using 
(18.1.15) shows that we have an involution of the algebra Pong X)/P— (XxX) 
and that it commutes with taking adjoints as well. No change is required if 
bundles are present. We have now proved 


Theorem 18.2.15. A properly supported pseudo-differential operator 
Pe P,(X ; E, F) satisfies the transmission condition with respect to the closure 


Y of an open subset with C” boundary if and only if 


(18.2.22) the symbol of P—P vanishes of infinite order on the interior 
conormal bundle of GY. 


The vanishing condition must of course be worked out_in local coor- 
dinates but it is independent of how they are chosen. Since P=e-?7i™ P we 
obtain from (18.2.22) the equivalent condition 


(18.2.22)’ the symbol of P —e?*™ P vanishes of infinite order on the exterior 
conormal bundle of OY. 


The orientation of OY as a boundary is therefore irrelevant when m is an 
integer. 

The proof of the sufficiency of (18.2.20) shows that Pu admits a C” 
extension from Y° to Y also if u is a smooth simple (or multiple) layer on 
OY. We shall now study the boundary values of Pu in that case. Again we 
may work locally so we assume that X =IR", that Y is defined by x, 20 and 
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that P=p(x,D) where p~)'p, for some p,; homogeneous of degree m—j 
satisfying (18.2.20). Let u=6(x,)@v(x”) where veC@(IR"~*). To compute 
Pu we take geCp((—1,1)) with fo@dt=1 and note that Pu=lim Pu, if 
u(x) = (x;/e) v(x")/e. Now 


Pu,(x)=(20)-"fe'® p(x, 2) be E,) HE) dé, 


and we shall prove that the integral with respect to €, has a limit when 
e>0. 


Lemma 18.2.16. Let q(t), teIR, be a continuous function and assume that there 
is an analytic function Q(t) in Qep={teC; ImtZ=0, |t|2R} for some R, such 
that Q(t)=O(t®) for some N when t> co in Qz, and q(t)—Q(t)=O(t-*) when 
t->0o on R. Then 


(18.2.23) {* aa | IO - QO) det J ade 
~Fo(Re'”) Ried 


is independent of the choice of Q. If F(t,s) is an analytic function of t when 
Imt20, for OSs<1, and F is a bounded continuous function of (s,t) then 
J* q(t) F(t,s)dt is a continuous function of s. 


Proof. If q=0 it follows from the maximum principle that 
sup |t? Q(t)(1 —iet)""~ >| <sup|t? Qo), 
QR OQR 


for (t—iet|21+elmt21 in dQ. Letting e+0 we conclude that t? Q(t) is 
bounded in Qp, so it follows from Cauchy’s integral formula that 


J Qdt=0. 

OQR 
This proves that (18.2.23) gives a unique definition of {* q(t)dt. The last 
statement follows at once by dominated convergence after Q(t) F(t,s) has 
been subtracted from q(f) F(t, s). 


Remark, If q is a rational function with no real pole, then the hypothesis of 
the lemma is fulfilled with Q equal to the sum of the terms of degree 2 —1 
in the Laurent expansion at infinity. Since g(t)—Q(t)=O(t~7) at infinity in 
C, it follows from Cauchy’s integral formula that {* q(t)dt is equal to 27i 
times the sum of the residues of q in the upper half plane. 


Let us now return to the boundary values of Pu. For fixed €” and large 
€, we have when €, — 00 


P(x, 8) — Yo et Fl p(x, 1,0) 2"%/a! =O(E 77) 
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if we sum over all j and « with «,=0 and j+|a|<2+Rem. This follows if 
we write p(x, 2)=|E "p(x, €,/Es|, @”€,), expand by Taylor’s formula 
and apply (18.2.20). We have 


je™5 d(eE N<floldt, Im, 20, 
provided that e<x,. Hence it follows that 
fel p(x, & Ple€,)dz, > [+ e™** p(x, HdE, 


when ¢—0, and the integral can be bounded by a power of (1+]é|) 
independent of ¢ and x,. Letting x,-0 also now, we conclude that 
Pu(x)— q(x”, D") v(x") when x, 0, where 


q(x", ")=(2n)-* |* pO, x", €,, 6") dé,. 
We have q~)/q; where 
(18.2.24) g(x", Os (22)- 1 ig p,(0,x"561,E") dd, 


is homogeneous of degree m+ 1-—j. Indeed, choose N so that N>Rem+2 
and set 
p= > Dit Ry. 


G<N 
Then we have RyeS®°"—*% when |é”|>1, say, and it follows that 


IDE. Dif Ru(O,x", 81 .E)4E 1S Cop fF E +E DRM Na 
SCag(h Eee 8, 


This proves that q~}'q j, Summing up, we have 


Theorem 18.2.17. Assume that p~))p,¢S?,,(IR" IR") and that the trans- 
mission conditions (18.2.20) are fulfilled. If veCZ(IR"—*) and u=6(x,)@ v(x") 
then p(x,D)u has a C™ extension from the half space {x € IR"; x; > 0} to its 
closure, and 


lim p(x, D) u= q(x", D") v(x") 
xi1J0 


where q~¥ q,6S%0'(IR"~' x IR"~*) with q, defined by (18.2.24). 

The boundary values of D* p(x, D)(Diu) are of course also given by 
pseudo-differential operators acting on v; we just have to apply the theorem 
to the pseudo-differential operator D* p(x,D) Di, the symbol of which is 
given by the calculus. 


A crucial point in the proof of Theorem 18.2.15 was the fact that functions 
in X with support in Y and with C© restriction to Y can be identified with 
the elements in 15,%*+?/*(X,0Y;E) with support in Y. For any complex 
number yp we can define the closely related space 


(18.2.25) CP(Y, E)= {uelby”-?"*(x, oY; E), suppuc Y}. 


18.2, Conormal Distributions t11 


(We have omitted X in the notation since it does not matter how the 
manifold with boundary Y is extended to an open manifold X.) If u has 
support in a local coordinate patch where Y is defined by x, 20 and E is 
trivial, then we can write 


(18.2.26) u(x)=f e'™5 a(x", €,) dé, 
where 
(18.2.27) a~yYia,(x", &,) 

0 


and a, is homogeneous of degree » —j. If we change the signs of x, and of 
€, it follows from Lemma 18.2.14 that a,(x”,¢,) can be extended to a 
homogeneous analytic function of €, in the half plane Imé, <0. In view of 
Example 7.1.17 it follows when yp is not an integer that u has an asymptotic 
expansion 


wl)~ Due @ xh ujeC?(IR"~*). 


Here we have used the notation of Section 3.2 and the expansion means that 
the difference between u and a partial sum of high order is as smooth as we 
please. If uw is an integer <—1 we obtain the functions vanishing when 
x, <0 which are in C® and O(xy;"~"') when x,20. Finally, when y is an 
integer 20 then uw is the sum of a function U, which is in C® when x,20 
and vanishes when x, <0, and a multiple layer 


Y 4X") @ 6P(x)), 
isu 


where uje C°(IR"~'). There are boundary problems for which one expects 
the solutions to behave as in one of these cases. 

With Y still denoting the closure of an open subset of the C” manifoid 
X with GYeC™”, we can now extend Theorem 18.2.15 as follows. 
Theorem 18.2.18. Let Pe'¥y,(X;E,F) be properly supported and assume that 
for every ueCP(Y,E) the restriction of Pu to the interior Y° of Y is in 
C°(Y°, F). Then the symbol of 


(18.2.28) P —e?%'# p 
vanishes of infinite order on the interior normal bundle of GY and conversely. 
Proof. We can follow the proof of Theorem 18.2.15 closely, working again in 


local coordinates. If u is of the form (18.2.26), (18.2.27) with a, homogeneous 
of degree p —j, then 


a,(x", l= eu J) a,(x", —1) 


by Lemma 18.2.14 with the sign of x, changed, for u vanishes when x, <0. 
Now Pu is of the form (18.2.26) with a replaced by a polyhomogeneous 
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symbol with the leading term po(0,x",¢,,0)a9(x”,¢,). Hence it follows from 
Lemma 18.2.14 that if PueC”(Y°) then 


Po(0, x”, —1, 0) ag(x”, —1) =e" t pO, x”, 1,0) ag(x”, 1) 
=e™™+ 2) D0, x”, 1, 0) ag(x”, —1). 
Since we can choose ad, so that a,(x”, —1)+0 at any point, it follows that 
Po(0, x", —1,0)=e7!"*2") pn (0, x”, 1, 0). 


Following the proof of Theorem 18.2.15 we can now obtain the correspond- 
ing condition for arbitrary derivatives of py or of the lower-order terms. The 
repetition of the details is left for the reader. 


Remark. It follows from Theorem 18.2.18 that the modified transmission 
condition examined in the theorem only depends on the residue class of py in 
C/Z. If (18.2.28) is valid then the symbol of 


e72nim+H) p_ p 
vanishes of infinite order on the exterior normal bundle. If 
(18.2.29) m+pt+pez 
it follows that 
P—e2"i"' p 
vanishes of infinite order on the exterior normal bundle. Hence we may 


replace Y by y'=[Y if uw is replaced at the same time by some y’ satisfying 
(18,2.29), 


18.3. Totally Characteristic Operators 


This section is devoted to the study of a class of pseudo-differential oper- 
ators in a C” manifold X with boundary 0X. (This notion is defined in 
section B.2 of Appendix B.) In Section 18.2 we introduced the transmission 
condition with respect to X for a pseudo-differential operator P in an open 
manifold of the same dimension containing X. This condition guarantees 
that P defines a map from CP(X) to C*(X) but P does not restrict to an 
operator from C7(dX) to C°(d6X) which is sometimes desirable in the study 
of boundary problems. A first order differential operator L has this property 
if and only if it is tangential to 0X (cf. Definition 18.2.6). The algebra of 
operators which we shall define is built up from such first order differential 
operators in the same way that standard pseudo-differential operators are 
built up from general first order differential operators. 

As a model for a manifold with boundary we shall use the closure R", 
of the half space R’, ={xelR"; x, >0}. As explained in Section B.2, if F is a 
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space of distributions in IR" we shall use the notation F(IR",) for the space 
of restrictions to IR". of elements in F and we shall write F(IR".) for the set 
of distributions in F supported by R%. The space C°(IR",.) is by Theo- 
rem 1.2.6 identical to the space C°(IR".) of C® functions in IR’. and we shall 
use both notations. 

According to Lemma 18.2.5 the first order differential operators in R" 
which are tangential to dR" are generated by the operators 0/@x pi<n, and 
x, 0/Ox,. We can extend Lemma 18.2.5 as follows: 


Lemma 18.3.1. The algebra Diff,( IR") generated by the first order differential 
operators with coefficients in C™(IR".) tangential to COIR", consists of the 
operators of the form 


P=) a,(x) x3" D* 
where a,éC (R",). 
Proof. Since x,D,=D,x,+i, the operators (x,D,)/, jSk, are linear com- 


binations of the operators x/ D/, j<k, and vice versa. The lemma is there- 
fore a consequence of Lemma 18.2.5. 


The elements of Diff,(R"%) will be called totally characteristic. Lem- 
ma 18.3.1 suggests extending the subspace Difff(IR".) of operators of order 
<m to a class of operators defined by 


(18.3.1) a(x, D)u=(2n) "fe a(x, ade, ueCe(R"), 
(18.3.2) G(x, 2)=alx, €, x, €,), 
X,20, C= d,-)s €0.Q=6 ~«,<0. 


Since our primary concern is the behavior as x,->0 we shall choose a in a 
symbol class S" defined so that @ will satisfy (18.1.1) for x,>1: 


Definition 18.3.2. By S% we shall denote the set of all ae C?(IR%. x IR") such 
that for all multi-indices a, B and all integers v20 


(18.3.3) [ah(x, 1S Cap, A +1D™- ML +x)7% xeIRG, EER" 
The kernel of (18.3.1) is the inverse Fourier transform 
(18.3.4) K(x, y)=(2m) "fel" a(x, dé; (x, y)elR?"; 


defined in the sense of Schwartz. It is a continuous function of x with values 
in /'(R}) when x,20 and vanishes when x,<0. We want (18.3.1) to depend 
only on the restriction of u to IR", so we must require that y,20 in supp K. 
This means that the oscillatory integral 


j gi in—Yn)on a(x, Ee Xn E)dé, 
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which is well defined when x,20 and y,<0O must vanish then. Taking x, ¢, 
as a new integration variable when x,>0 we write this condition in the 
form 


(18.3.5) fe a(x, €,€,)dé,=0 if ts —1 and x,20, 
that is, the Fourier transform with respect to €, must vanish when t< —1. 
Definition 18.3.3. We shall say that aeS" is lacunary or satisfies the lacunary 


condition if (18.3.5) is fulfilled. The set S7, of all aeS” satisfying (18.3.5) is a 
closed subspace of S", thus a Fréchet space. 


The lacunary condition is not very restrictive, for just like the condition 
of proper support it only affects the residual part of the symbol: 


Lemma 18.3.4. Let peS(IR), p=1 in a neighborhood of 0 and supppc 
(—4, 1). If aeS™ it follows that 

a,(x,2)= J a(x,é,€,—1) p(tdt 
is in Swi, we have a—a,eS;,°, the map S" 3ar+a~a,eES>” is continuous, and 
X,/2SY,S2x, in the support of the kernel of G,(x, D). 


Proof. The Fourier transform of a, with respect to ¢, is the product of 6 
and that of a, so we obtain not only (18.3.5) but a stronger condition 


(18.3.5) fe a,(x,,6,)de,=0 if t¢(—4, 1). 


’ This implies that x,/2<y,S2x, in the support of the kernel of G,(x, D). 
Now the condition on p implies that j pdt=1, {t/ pdt=0, j>0. Hence 


a(x, 6) —a(x, C)= { (a(x, 2,6, — )) A a(x, 6)/0e(—tH/j!) p(t) dt 
-@ I<N 


for any N. By Taylor’s formula the integrand can be estimated by 
Cy yeh +E)" "+x, "lel if lel <1E1/2. 
Evaluating each term separately we obtain the bound 
Cy wld + eX +x," oO 
SC, (1 Hf)" MA +x," + 24e)? OF [p(a, |e] > 11/2. 


Since peY and N is arbitrary this shows that a,(x,¢)—a(x,¢) can be 
estimated by any power of (1+|é|)-‘(1+ x,)~'; the constants obtained are 
semi-norms in S". This is also true for all derivatives of a,—a since 
convolution with p commutes with them. The proof is complete. 


From the lemma it follows that 


Sia/ Sia” = S/S". 
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The restriction to lacunary symbols will therefore have no effect on the 
symbol calculus which we shall develop along the lines of Section 18.1. First 
we give an analogue of Theorem 18.1.6. 


Theorem 18.3.5. If aeSt, and ueS(R’,) then (18.3.1) applied to any extension 
of uin SIR") defines a(x, D)ue S(IR",), and the bilinear map (a,u)+> G(x, D)u 
is continuous in these spaces. We have 


(18.3.6) [a(x, D), Dj] = i dgy(x, D) + i 5, a(x, D) Dn, 
[a(x, D), xj) = —i a(x, D) — i bin Xn AMX, D). 


For any integer k20 


k . 
(18.3.7) Dé a(x, D)ul,o= ¥. ") a, (x',D’)(Diul,, 9), 
jSk 


a, (x, €)= ¥. ( ) Dk-i Di. a(x’,0, €, )eS™0R"-! x IR"). 


isj 


Proof. The lacunary condition guarantees that y,20 in the support of the 
kernel K. Hence (18.3.1) is independent of the extension chosen. Since ie S 
it is clear that d(x,D)u is a C®” function with all derivatives bounded. 
Differentiation under the integral sign or integration by parts gives (18.3.6). 
As in the proof of Theoremi8.1.6 it follows then that the map 
S™ x A(R" )a(a,uea(x,D)u is continuous with values in A(R"). 
When we differentiate with respect to x, under the integral sign in (18.3.1) a 
factor €, appears when the derivative falls on the exponential function or 
the last argument of a(x, é’,x,€,). When there are j such derivatives altog- 
ether we obtain (18.3.7) since 


(2m)~* | Efa(2,6,) de, 


is the Fourier transform of D/ u(x’,0) with respect to x’. 


(18.3.7) shows that the purpose of the definition of the operators G(x, D) 
has been achieved: all normal derivatives of a(x, D)u on the boundary can 
be calculated by letting pseudo-differential operators in the boundary act on 
the normal derivatives of at most the same order. We shall now show that 
the main structure of the calculus of pseudo-differential operators is pre- 
served in spite of the fact that a(x,&), defined by (18.3.2), has rather bad 
symbol properties with respect to x,. One of the main differences is that the 
kernel of &(x,D) may have some singularities even when aeS~~. We shall 
therefore examine such operators now, using the notation 


Q={(x, y)ER?"; x,20, y,20}, 6,0={(x, y)ER, x,=y, =0} 


for the quaterspace containing the support of the kernel K and for its 
distinguished boundary. In Q it is convenient to use the symmetric singular 
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(—2,0) r 


Fig.2 


(polar) coordinates with respect to x, and y, given by (see Fig. 2) 
(18.3.8) t=(x,+y,)/2, r=(xX,—y,/t=2(x, —y MX, +Y,)s 
that is, x,=¢(1+r/2), y,=t(1 —r/2). Note that t20 and |r| <2 if (x, yyeQ. 
Theorem 18.3.6. If aeS;,~ then the kernel K of &(x,D) is in Lj,,(QR?") and in 
C~™(R2"~ d,Q), supp K <Q, and 
F(x’, y',t,r)=t K(x’, t(1+1r/2), y’,t(1 —1/2)) 
is in C° when t=0, equal to 0 when |r|22. For all a, B, t, p, v we have 
(18.3.9) |D%- D8, Di D? F(x',y,t, lS Cagepy(1 +1x’ —y'| +0)”. 
Conversely, if KeEL},,(R?"), supp K <Q, and F has these properties, then K is 


loc 


the kernel of G(x, D) for some aeS;,”. 


Proof. The inverse Fourier transform 

(18.3.10) A(x, y)=(22)-" fel a(x, E)dé 

is a C® function when x, 20. The proof of Lemma 7.1.3 gives 
(18.3.11) [Dz DS A(x, SC rpv(l tly) “A+x,)-%, x, 20, 


for all a, 8,N and shows that conversely every A satisfying (18.3.11) with 
A(x, y)=0 for y,>1 is of the form (18.3.10) with aeSj,°. The kernel K is a 
continuous function of x with values in Y when x, 20, and (18.3.4) gives 


(18.3.12) K(x, y) =A(x, x’ —y',(%,—Y_)/XpWXqo Xp >0. 


Hence KeL},,(R7"), supp K <Q and KeC® when x,>0. If t=0 and r> —2 
then 


(18.3.13) F(x’, yt.) =2A(x’, t(1+r/2), x’ —y’,2r/(2+n)/(2+r). 
This is a C® function. With x,=t(1+7r/2) and y,=2r/(2+r) we have 
1+|y,/22/(2+n, 14+'4y,|+x,22¢. 


We can therefore estimate F by any power of (1+r/2)(1+|x’—y'|+0)~!. The 
same is true for all derivatives of F since these are sums of terms of the 
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same form as F but with A replaced by a derivative and powers of t and 
1/f(2+r) as factors. If we set F(x’, y’,t,7)=0 when rs —2 it follows that 
FeC® when t=0 and that (18.3.9) holds. 

Conversely, assume that KeL,,(IR2"), that supp K cQ and that FeC® 


loc 


when t20, the estimates (18.3.9) being valid. Since t(1+r/2)=x, and 
2rf2+r)=y, implies r=2y,/(2—y,), t=x,(2—y,)/2 if x,>0 and y, <2, we set 


(18.3.13Y A(x, y)=2 F(x’, x'—y’,x,(2—Y,)/2, 2 V2 —Y_) (2 — Yn), 


y, <2, and A(x, y)=0 when y,>1. The two definitions agree in the common 
domain since 2y,/(2—y,)>2 when 1<y,<2. When y,<1 we have 


2y,/(2—y,) +2=4/(2—y,) S81 +ly,h), 
1+x,(2—y,)/221+x,/2. 


Note that since F=0 when r< —2 we can use Taylor’s formula to streng- 
then (18.3.9) to 


(18.3.9) |Dz, Dy, Di Df F(x’, y’,t,7)| 
SCy ppv +|x’ —y'l+t)7 (+2). 


aptpy 


This shows that (18.3.11) follows from (18.3.9) and completes the proof. 


Remarks. 1. In terms of the notions introduced in Section 18.2 the condition 
FeC® means precisely that Kel5"7(R", 0, Q). We leave the simple verifi- 
cation for the reader. 
2. The proof shows that if aeS;,"~? then 

(18.3.14) |K(x, y]S CL +1x’ —y')-" 1x, Val X nl +1)? 
In fact, since (1 +|y’))"(1 +ly, |)? |A(x, y)] SC we have 

(1+ |x’ —y' "1K (x, yl S CCL 41%, —Yal/ Xa) 7/ Xp = CX p/(X q+ Vn XQ) 

S8Cx,y,/(X,+y,)> if O<x,Sy,. 

In addition (1+{y'|)"|A(x, y/SC{l—y,|, for A(x,y)=0 when y,>I1 and 
(1+ly'l)" |CA(x, y)/ey,| SC. Hence 

(L+|x!—y DIK(x, WS Cyq/X2 S8CX, Yelp tY,)? — if O<y,<X,. 


This proves (18.3.14) which will be a convenient starting point for the proof 
of L? estimates below. 


It follows from Theorem 18.3.6 that if aeS7,~ then G(x, D)*=b(x, D) 
where beS;,*. To prepare for the proof of the analogous result for S7, we 
shali now give an analytic expression for b when aeSj,~ and the strong 
lacunary condition 


(18.3.5)” fet a(x,€,€,)dé,=0 if t¢(—1,4) 
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is fulfilled. With the notation (18.3.10) this means that A(x, y)=0 when y,¢ 
(—1,4), thus 4S|x,/y,| <2 if (x, y)esupp K. By (18.1.9) and (18.1.10) we have 


(18.315) (@(x, D) u, v)=(u,b(x, D)v); — u, ve SAR"); 


where b= e'<)» P® G(x, 2)e FY’ and x,20 in suppb. If ¢¢C7(IR) and ¢=0 in 
(00,1), P=1 in (2,00), then $(x,/e) G(x, C)= b(x,/e) a(x, &, Xn FES” * and 
converges to din Y’ as e>0. Hence 6 is the #’ limit as e—0 of 


(x, ErRo(20)-" [fe F-"F-”? H(y,/e) Ay. Vata dy an 


defined by Theorem 7.6.5 or interpreted as an integral first with respect to 7, 
then with respect to y. The integral with respect to 7, vanishes by (18.3.5)” 
unless (y, —x,,)/y,€(—1,4), that is, $<y,/x,<2. Choose yeC?(4, 3) equal to 1 
in (3,2). Without changing the integral we can insert a factor x(y,/x,)- 
Letting ¢ 0 we then obtain when x,>0 


b(x, &, x, €,) = 5(x, €) 
=(2n)-" ff en FP a(x —y, & —1', (Xn —Yn) (En ~ Ma) Xn — Yn)/Xn) dy an. 


Yn<Xn 


The integral exists as a repeated integral. Replacing y, by x,y, and y, by 
n,/X, we conclude that 


(18.3.16) (x, é) 
=(2n)~" [fet a(x’ —y',x, (1 ~y)s € —',(1 — yn) En — 0) KO —y,) dy dn 
or ef Py Pad (a(y’, x, Yao 1’; Yn Nn) LOMy= &', L.q= 


We shall now study this formula when aeS") for some finite m. 


Lemma 18.3.7. [f acS" and yeC? (0, 0) is equal to 1 in a neighborhood of 1, 
then (18.3.16) defines a symbol beS, such that 


1 : i zs3! , 
(18.3.17) b(x, C)~ 25 <D, iD,» ay Xn Vn Vn Nip este lyse 


i ston ng Vs, 
=Liy <D,, iD,»’ aly, > Yn Hip "= San = SnlXn" 
The map S" 3a+> beSj, is continuous. 
The asymptotic sum is well defined in the sense of Proposition 18.1.3 for 
if a y derivative falls on the argument y,y, it causes the degree of a to 


decrease by one unit which compensates for the appearance of a factor y,,. 


Proof. There is a constant M such that supp yc(M~',M). If we set 
Cxy(Ve = BY, Xn Ino 1's Vn Mn) XVn) 
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it follows from the bounds 1/Msy,<M _ in the support that (1+ x,)’c,. 
belongs to a bounded set in S” for every v and x, 20. By Theorem 18.1.7 
Cy, 9, X,) =e??? c, (y,m) 
is a C® function of (y, 4), and we have for all «, B,v 
\D3 DE Cy, 0, XMS Cap (1 ig” +x)”. 
Since the derivatives of c, with respect to x, are of the same form it follows 
that C is a C® function of all variables and that 
[Di Dz DE C(y, 9, XMS Cag (1 + tly" +-x,)-”. 


Hence b(x, €)= C(x’, 1, &,x,) is in 8%. The asymptotic expansion also follows 
at once from Theorem 18.1.7 since y=1 in a neighborhood of 1. If aeS;° 
we obtain by taking ¢,—7, as new integration variable in the first form of 
(18.3.16) 


J b(x, dei dé, 

= (2m) " [fet + a(x’ —y/, x, (1-0), & —n',(1—t)n,) x1 —t) dy dn 
which is equal to 0 unless M~'S1—t<M, that is, -1-M<t<1-M~". If 
aeS" we obtain the same conclusion by taking a,eS,;~ converging to a in 
S"*?, Thus beS7, which completes the proof. 


Remark. That the lacunary condition holds is no surprise since x has the 
effect of cutting off the kernel of G(x, D) by a factor x(x,/y,). 


Theorem 18.3.8. For every aeS™ the adjoint of G(x, D) is equal to b(x,D) for 
some beSf, in the sense that (18.3.15) is valid. If a satisfies the strong 
lacunary condition (18.3.5) and y¢C@(0, 00) is equal to 1 in (§,2) then b is 
given by (18.3.16) and has the asymptotic expansion (18.3.17). 

Proof. If aeSj;,* the first statement follows from Theorem 18.3.6 for 
(x,y) +» K(y,x) has the same properties as K. Thus the first statement follows 
from the second one and Lemma 18.3.4. If aeSy, and (18.3.5)” is fulfilled we 
choose a,eS;° so that aj>a in S"*' as joo and define a,, as in 
Lemma 18.3.4. Then a,,¢S;,” satisfies (18.3.5)” and a,,>a, in S™*1 as 
joo. Let beS7,° and byeS7, be defined by (18.3.16) with a replaced by a,, 
and by a,. Then 


(G,(x, D) u,v) =(u,6,(x,D)v);  u, ve; 
and b,—b, in S7,* as j-> 00. Hence it follows from Theorem 18.3.5 that 
(4,(x, D) u, v)=(u, by (x, D) v); u, ves. 
Since a — a, € S7™ satisfies (18.3.5)” we also have with b — by € S,° 
((a(x, D) ~G,(x, D))u, v) =(u,(6(x, D)— by (x, D)) v); u, ve S; 
which gives (18.3.15) and completes the proof. 


120 XVIII. Pseudo-Differential Operators 


By Theorem 18.3.5 d(x, D) is a continuous map # (R".) > A(R"), and the 

same is true for the adjoint 5(x, D). Hence a continuous map 
a(x, D): FIR") > F(R") 
is defined by (18.3.15) with ue ¥'(IR’,) and ve A(R’). If ue A(R") is extend- 
ed to ueS'(R4) by defining up=u in R% and ug=0 elsewhere, then 
a(x, D)uo is the function 4(x,D)u defined by (18.3.1). By the Hahn-Banach 
theorem the restriction map #'(R".) > /(IR“,) is surjective. The kernel is 
(18.3.18) '(IR"., IR") ={ue¥'(R"); suppucAR".} =|) KR", AR’), 
(18.3.18)' SK(IR",, OR") = {ue F(R"); x*u = O}. 
Here we have used Theorem 2.3.5 and the fact that temperate distributions 
are of finite order. Now 
(xk G(x, D) u,v) =(u, 6(x, D)xfv)=0, ve A(R), ueS%'(R',, OR") 

for b(x,D)x*v vanishes of order k when x,=O by virtue of (18.3.7), so we 
can take out a factor x* which annihilates u when moved to the left. Hence 


(18.3.19) a(x, D) KR", OR.) K'(IR"., GIR") for every k. 


Thus &(x,D) induces a map /(IR4)>/(R%) which is still defined by 
(18.3.15), now with ue F(R’) and ve ¥(R*). Its restriction to CPUR%) 
determines G(x, D) on #’(IR“.) since CP (IR".) is dense there. 


In the standard calculus of pseudo-differential operators one ignores C® 
functions since they are in the range of operators of order —oo. A some- 
what larger class of distributions is neglected in the totally characteristic 
calculus. 


Theorem 18.3.9. If aeS;,~ and ueS'(IR",) then G(x, D)uel*(R", OR") for 
some k, and supp G(x, D)uc R".. 


Proof. By Definition 18.2.6 the statement means that the order of the distri- 
bution .D* (x, D,)*" d(x, D)u has a bound independent of a. Now 
(D* (x, D,) G(x, D) u, v) =(u, b(x, D) D* (D,, x,)"" v) = (u, b,(x, D) v), 
if ve A(R") and 
b,(x, 6) =o" (6, ~i 16, 8/0&,)*" b(x, AeSi,”. 


It suffices to verify this for the operator b(x, D) D, x,=5(x, D)x, D, —ib(x, D). 
Then it follows from (18.3.1) since the Fourier transform of x,D,v is 
—D,6é,0 and 


E, Dz, (2° B(x, A =e x, €,(B(x, 8) —1b™ (x, 6). 
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Hence it follows from Theorem 18.3.5 that for suitable y, y’ independent of « 
|(D* (x, D,)*" G(x, D)u,v)|SCsup |x’ Db, (x, D)o| 


R+ |#+y|Su 


SC,sup >) |x? Do}. 


RE [p+y1su’ 
Here we have used the following lemma: 
Lemma 18.3.10. The quotient topology in S(R",) is defined by the semi-norms 
S(R*)aviresup|x* D? v]. 
Ry 


Proof. That these are continuous semi-norms in (IR",) is obvious for they 
are continuous in S(IR") and constant in each equivalence class. On the 
other hand, let g be a continuous semi-norm on S(IR"). Then q is a 
continuous semi-norm on S(R"), so 
qwv)SC YY sup|x* DF y| 
je+s|sk R” 


and q(v)=0 if v=0 in R".. If ve A(R") we now set 
B(x)=v(x), x,20, dx)=x(x,) Y Go’, Ox</j!, x, <0, 
gmk 


where yeC® and y=1 in (—1,1). Choose ¢eC#(R") with { pdx =1 and set 
o,(x)= o(x/e)e-*. Then d*6,eX% ved,=i*¢, in R®, and v*ed,>v in F 
when e— 0. Hence 
q(v)=lim q(vx¢,)=limg(5*o,) SC’ )) — sup|x*D* o|, 
E> 0 e>0 jat+B}s2k RY 
which completes the proof. (Using the proof of Theorem 2.3.5 we could of 
course replace 2k by k here.) 


Recall that for the distributions in the theorem the wave front set is 
contained in the conormal bundle of the boundary. In many contexts one 
can exclude such singularities for other reasons. However, we postpone the 
discussion of this and related matters until we have completed the remain- 
ing parts of the calculus, the product formula and the invariance under 
change of variables. 


Theorem 18.3.11. If ajeSt, j=1,2, then G,(x,D)a,(x,D)=6(x,D) where 
beST*™ is given by 
(18.3.20) B(x, é) = é Oe; aad a 1(X, n) a, i, Xn Yao g, ae Va)ly= {x’, L)q=€ 

~¥a® (x, €) D®, D® a4(x',5Xq',5E,)/0!l,- 1. 


Proof. Assume first that a,¢S;,% and that a,(x,é)=0 for large |x|. If 
ueS(R") it follows then that @,(x, D)ueé’ has the Fourier transform 


grs(2m)"felF— aly, Qa(Qdydé. 
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When a is of order — oo then (18.3.1) remains valid for reasons of continuity 
when ueé’(IR".) and x,>0, for Ere! a(x, €) is then in Y and is a C® 
function of x with values in “ Hence we obtain as in the proof of 
Theorem 18.1.8 that G,(x, D)@,(x, D)u=6(x, D)u, x,>0, if 
B(x,.)=(2m)~" fet 5—™ G(x, n) Gay, Ody dn, 

that is, replacing €, by ¢,/x,,", by y,/x, and y, by x,y, 
(18.3.20)' B(x, €) 

=(2m)—-" JP ePID En— A) (x, 9) 2 (Y's Xp Ino So En Yn) Ay aN. 


In>O 


We shall now show that (18.3.20)’ defines for arbitrary m,, m, a continuous 

bilinear map Sf? x S7?3(a,,a,)r>beSf +", It will then be easy to show that 

b(x, D) =4,(x, D) G(x, D) without the simplifying assumptions made above. 
Choose yeC?(R.,) as in Lemma 18.3.7. To study (18.3.20)' with a cutoff 


function (y,) inserted in the integral we introduce 


Fy, (Vs) = 44 (%, 1) (V's Vn C5 Sn Vn) XV 0) 


Then (y, gro (1+|é)-"" (1 +x,)"f, (y, 7) is uniformly bounded in S™ for any 
v. In fact, y, lies between fixed positive bounds in the support, a differen- 
tiation with respect to y, bringing out a factor €, is accompanied by a 
decrease in the order of a,, and |€,|/(1+/¢|+1€, y,|}S1/ly,|. Hence it follows 
from Theorem 18.1.7 that 


by(x, é) = Pw Par fF el), Mly- (x', 1). a= & 
is defined and that 


[by (x, é)- Y <iD,, D>) a4 (XN) 4a(V's Vn Xns Oona ya’, tena el | 


J<N 
SCy, (1 +x,)7 "+E ym 


In view of Proposition 18.1.4 it follows that b,e«S™'*"? and that b, has the 
stated asymptotic expansion. 
Set 


A,(x, y)=(2m)~"(L—y(1 —yy)) fe? ay (x, 2) a6. 
Since y(1—y,})=1 in a neighborhood of 0, the proof of (18.3.11) gives 
(8.3.11) |D% DE A,(x, IS Cypn(l tly) MU +x,)7% 


for any N; the constants C,sy are semi-norms of a, in S™. Since A,(x, y) 
=0 when y,>1 by the lacunary condition, we also have by Taylor’s formula 


(18.3.11)” [DDE A(x, NN [yn 1-¥ S Cog (l + Ly) 28 (1 +26,) 7%. 
Now we have b=b, +b, where 
by (x, d= eh “MS 1A “Yen B(x, y, Edy, 
B(x, Ys C)=A p(X, x — y's L—Yq) G2 (V's Xp Vn Ss Sn Vn) 
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If we observe again that a differentiation with respect to y, which brings out 
a factor €, is accompanied by a decrease of the order of a,, we obtain 


|D% D8 Di g(x,y, HS Crgyw(l tlx—yl+x,) XC + Eyre 


in view of (18.3.11)”’ and the fact that 1+/€’1+/é, y,J21y,|(1 + 1€D/A1 +1y, |). 
Since for every B 


EP a(x, E)= J ei MF) “1 Yon _ DP g(x, y, ) dy 


it follows at once that b,¢S7°. 

When a,eS;,° and a,eS7?, a,(x, €)=0 for large |x|, we have now shown 
that a,(x, D)a,(x, D)u=b(x,D)u in R*, ueY, where beS{” is given by 
(18.3.20). Since this is 0 when u=O0 in R”, we have beS;,*. If a, does not 
vanish for large |x| we set a,,(x, €)=7(x/v)a,(x,€) where yeCo> is equal to 1 
in a neighborhood of 0. Then 4,(x, D)4,,(x, D)=b,(x, D) where b,eS;,” and 
b,>b in S{° as v-oo by the proof above, b always being defined by 
(18.3.20). Hence beS7,* and G,(x, D)d,(x, D)=b(x, D). The hypothesis that 
m,=—0o is now removed exactly as in the proof of Theorem 18.3.8 which 
completes the proof. 


The proof of Theorem 18.1.11 is now easily modified to a proof of I? 
estimates for totally characteristic operators. 


Theorem 18.3.12. If aeS, then G(x, D) is continuous in I? (IR",). 


Proof. If aeS;,"~* then the kernel K of G(x, D) satisfies (18.3.14) and the L? 
continuity follows from Lemma 18.1.12. If aeSj;* we have 


\|a(x, D) ull? = (B(x, D) G(x, D)u, u) = (E(x, D)u,u), ueS, 
where 5(x,D) is the adjoint of G(x,D) and ceSj,7*. If &(x,D) is already 


la 
known to be I? continuous we obtain the same result for G(x, D), so the 
theorem follows for all aeS,,* if k< —(n+2)/2, then if k< —(n+2)/4 and so 
on. The proof is then completed by taking an approximate square root 
satisfying the lacunary condition just as in the proof of Theorem 18.1.11. 
(Note that (M?—|a|?)}*—MeS® if aeS® and sup|a|<MeR.) The repetition 
is left as an exercise for the reader. 


Since &(x,D) commutes approximately with differential operators it is 
possible to extend Theorem 18.3.12 to H,,) spaces: 


Theorem 18.3.13. If aeSp, then &(x,D) is continuous in H,(R".) and in 
H (IR) for every seR. 


Proof. For s=0 this is just Theorem 18.3.12. If we S(IR") we have by Theo- 
rem 18.3.5 


D, a(x, D) u=4(x, D) Dju—id,,4(x, D) D,u—iG,)(x, D)u. 
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If s is a non-negative integer for which the theorem is proved we obtain 
if ue A(R") 
||4(x, D) ull. 1)= NG(x, Dy ull +) 1D, a(x, D) ull 
1 


SC(lull +d ID wlE=C lulls v- 
1 


Now S(IR".) is dense in H,,.,,(R".) by Theorem B.2.1 so this proves con- 
tinuity in H, +1(R",). Using (B.2.2) we obtain in the same way the con- 
tinuity in H,,, ,,(R".). By Theorem 18.3.8 the adjoint of G(x, D) in H,,(IR",) is 
for every s the operator b(x,D) in H,_,,(IR".) where beS?,. By duality we 
therefore obtain the continuity for negative integers s also. The proof will be 
completed by an interpolation argument close to the proof of 
Corollary B.1.6, which establishes continuity in H,,+,,(R".) if k is an integer 
and |s| <3. 

Choose we C7 (IR",) with W(é)=1+O0(\é|?) as E0. To do so we can first 
take peC2(R") with $(0)=1 and set W=2¢—+¢, for P=1-(1—6) 
then. Writing w,(x)=e~" w(x/e) we have for |s| <4 


1 1 
Silveullig ye’? de+ filu—weullg_ye7P des C llullo sy 
0 a) 
when ue& This is equivalent to the elementary inequalities 
1 1 
(WedPe-*deSCatle?y’, f[t—PedPe-?- des CA +E?) 
0 0 


Both are obvious if |é|<1, since |1—W(e}?e—3-25 is bounded then, and 
when |é|>1 they follow if ¢ is replaced by ¢é/|é| and the integrals are 
extended to + 00. Set v,=a(x, D)(y,*u), w,=a(x, D)(u—p,*u). Then 


1 
_ 2 —3-2 2 
Felis ye *sde+ IWella— \desc, HUllisen 
0 
in view of the continuity in Hg+1,(R",), that is, 


1 
(20)~"f f V6, (E)?7 (e? + le E174? +1, (OI? (6? +e 7 )e 1726 db de 
0 
SC, |lUlloau- 
If U=G(x, D)u we have U =3,+¥,, hence 
IG()? SC, (16, (EP? (e? + le Ek ** + 1H (OIE? +e E/?)*-*) 
if 1<2e(1+|é{)<2. Since 


e—1-2684+0 de> C,(1+/E|?)°t* 
1<2e(1+ [é)<2 
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we obtain the required estimate |UNe awe allullé au: The proof is com- 
plete. 

The interpolation argument is also applicable to Besov spaces; we con- 
tent ourselves with the case of interest here: 


Theorem 18.3.14. If aeS?, then G(x, D) is continuous in ° A(R ".) for every s. 


Proof. First we show that for ue®H,,+,(IR".) and |s| <4, 0<e<1, 
ly ella ye 8+ lu eullg_ ye 7-7 °SC MMlulla, 9 


This follows if we show with the notation in (B.1.2) that 
Yup WWegy(eR)- +l Weg PeR)7-*) <0, 


The terms where ¢R,<1 can all be estimated by (¢R,)?~** since 
[L—(e 2)? S Cle E|* then, and those with ¢R,>1 can be estimated by 
(eR,)-?-*5 since |y(e el < Chleel? then. The sum of a geometric series with 
ratio <4 is at most twice the largest term which proves the statement. Now 
we obtain with the notation in the proof of Theorem 18.3.13 


esto laa te 7 well S CP lullass 
that is, 
e749 FGA? (2? + le)? + WW COP (e? Hes?) de 
SC” lula») 
If we restrict the integration to X, and take «=1/R, it follows that 
Ref (OOP dot SCPlullisy 


Xj 


which completes the proof. 


We can apply Theorem 18.3.14 to the spaces I* in Definition 18.2.6: 


Corollary 18.3.15. If aeS?, and uel*(IR", dR") 1 (IR“), then 
a(x, D) uel*(IR", OR") F(R"). 


Proof. We must show that P(x, D)a(x,D)ue*H!%,_»4, if PeDifft (R".). 
Choose an even integer M>m-+m’ and set O(€)=|é|“. By Theorem 18.3.11 
we can use the argument in the proof of Theorem 18.1.9 to show that 


P(x, D) G(x, D) = F(x, D) O(x, D) + G(x, D) 


for some F, GeS?,, if the coefficients of P have compact support. Since 
Q(x, D)u and u are in A _%_nja) by hypothesis, the statement follows now 


from Theorem 18.3.14. 
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We shall later on use a dual version of Corollary 18.3.15 to obtain the 
continuity of d(x, D) in another space which occurs in the study of boundary 
problems. (See Proposition 18.3.23.) 

All operators of order —oo do not map He into Hy) for any s’>s. In 
fact, let us consider (G(x, D)u,,v,)={{K(x, y)u,(y)v,(x)dxdy with u,(y) 
=etu(y’', y,/€), v.(x)=e7? v(x’,x,/e) and suitable u, veC%. As e>0 it is 
easily seen that if G(x, D) is continuous from H (s) tO H ¢) and s’>s then the 
function F in Theorem 18.3.6 must vanish when t=0. However, operators of 
positive order cannot consume more differentiability than in the case of 
standard pseudo-differential operators: 


Theorem 18.3.16. If aeSj, and m20, then (x, D) is continuous from H,)(R".) 
to His_m(IR".) and from H,,)(IR".) to Hy. m(IR".) for any seR. 


Proof. The first statement implies the second one by duality. When proving 
it we first assume that m is a positive integer and that the statement has 
already been established for smaller values of m. We can write 


a(x, =) oj a,(x, é) +(x, é) 
1 
where a,eS7,*. In fact, writing 


b,(x,€)=€,a(x, E/A+1E|?), 70; bolx, )= a(x, H/(1 +1); 
we have a(x, £)=)' €,b,(x, £)+b (x, €) and can take 


a;=(bj)p, j#0;  ay=byo+)E(b;-a)=a—)'E,a;, 
1 1 


with the notation in Lemma 18.3.4. Since 


n—1 


a(x, D)u= }° G,(x, D)D,u+x,4,(x, D)D,u+Go(x, D)u 
T 


it follows from the inductive hypothesis that a has the stated continuity 
property. Moreover, (a,u)+d(x,D)u is continuous from Sf, x Hj,(IR".) to 
FAs — mR". ). 

A complex interpolation argument similar to that in the proof of Theo- 
rem 7.1.12 will now prove the general statement. Set 


B(x, )=a(x, YU+/E2)°-™, A, =(B,), +a—a,. 


Then A, is analytic in z, A,=a and A,eSf*?; the semi-norms can be 
estimated by a power of (1+|z]) when Rez is bounded. If M is an integer 
>m it follows from the first part of the proof that when Rez=0 or Rez 


=M we have for some C, and yp 


(A,(x, D)u, (L+|D}?)~*? I SC + 2) lull lol gs = ve ORY), ve F(R"). 
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When 0SRezSM we have a weaker estimate with ||u||,,. replaced by 
\lUll(s+a> for A,eSM and the semi-norms there can be estimated by a power 
of (1+ zl). Since 1+ |z)<|2z+2|, Rez20, it follows from the Phragmén- 
Lindeldf theorem that when 0< Rez<M we have for the same u and v 


(A_(x, D) u, (1 +|D|?)~7? vo] SC,|22 +2!" llullyy ells): 
This means that 
| AL(x, D)ullis_rez) S Cy |2z+2/4 lulls), ue S(R".). 


When z=m this proves the first part of the theorem when m>0, hence the 
second part when m<0Q. The proof is completed by the argument at the 
beginning. 


To discuss invariance under a change of variables we shall resume the 
study of the kernel K of G(x,D) begun with Theorem 18.3.6, assuming only 
that aeSf, now. The inverse Fourier transform 


A(x, y)=(2)~"f ek"? a(x, n)dn 


is then a conormal distributione/"(R?", {y=0}) with principal symbol 
a(x,n)|dx|?|dn|? on the normal bundle {(x,0,0,7)} of the plane y=0. (To be 
quite precise we should observe that a and therefore A is only defined when 
xX, 20. However, this is obviously inessential since x, is just a parameter on 
which A depends in a C® fashion. Quite generally, if X is a manifold with 
boundary and Y a submanifold intersecting 0X transversally, then the defi- 
nition of I”(X, Y) in Section 18.2 can be applied if we just use local coor- 
dinate systems near the boundary such that X is defined by x, 20, say, and 
Y by x,=...=x,=0. We leave this slight extension for the reader.) The 
kernel K of G(x, D) is given by (18.3.12), 


K(x, y)= A(x, x’ —y’, (x, —Y,)/%,)/Xn> x, >9. 


By (18.3.1) K is a continuous function of x with values in #’ when x, 20, 
and K=0 when x,<0. To interpret this we introduce again the symmetric 
singular coordinates (18.3.8). Since D(t,r)/D(x,,y,)= —1/t, we obtain if K is 
transformed as a half density to k(x’, y’,t,r)= K(x, y) t* 


(18.3.21) k(x’, y',t,r)= A(x’, (1 +7/2), x’ —y’, 2r/(2+r))(14+r/2)-' 73. 


Thus t?k(x’,y,t,r) is a C® function when 0<|r|<2, t20. All derivatives 
tend to 0 when |r|—2. In fact, this follows when r— —2 from the fact that 
(18.3.11) is valid when |y,| is bounded away from 0. When r—2 the state- 
ment also follows from (18.3.11) and the fact that A(x, y)=0 when y,>1 by 
the lacunary condition. Defining k=0 when |r| 22 we obtain ttkeI™(IR2", A) 
where IR2” is defined by t=0 and 


4={(x',y,t,n; x =y',r=0,t20}, 
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which corresponds to the diagonal in the original coordinates. With the 
corresponding parametrization of the conormal bundle of 4 as {(x’,x’,t,0,&, 
— €’,0, p)} the principal symbol of t? k is 


(18.3.22) a(x’, t, &, p)|dx'|* |de|* |dé"|? |dpl?. 


In fact, 2(1+1r/2)?k is the pullback of A as a half density by the diffeomor- 
phism (x’, y’,t,r)ho(x’,t(1+r/2),x’—y’,2r/(2+r)). When r=0 the differential 
is (dx',dy’,dt,dr)t+o(dx’,dt+tdr/2,dx'—dy’,dr); the adjoint maps the ele- 
ment (x, 0,0, €) in the normal bundle of IR” x {0} to (x’,x’,x,,0; &, —€’,0,€,). 
When ueC> the substitution y,—sx, gives, with all integrals representing 
the action of a distribution 


J K(x, y) u(y) dy =fJ KO’, yx, (1 +3)/2, 21 —s)/(s + 1)) (A +)/2)-# 
-u(y’, 5x,) x2 dy’ ds. 
If ky is the limit of t*k as t0 then this converges to 


JJ ko, y' 20 —s)(s + 1) (21 +5)) u(y’, 0) dy’ ds 
when x, 0. 
Conversely, given a distribution t*keJ”(IR3", 4) of compact support, 
vanishing when |r|>2, we can reverse the argument by introducing for 
x,20 


(18.3.21Y — A(x, y= K(X', x’ —y’, x,(2—Yq)/2, 2 Yy/(2—y,)) x2 (1 — y,/2) 


which vanishes for y,>1, decreases rapidly as yoo, and is conormal 
with respect to the plane y=0. From A we return to aeS™, by 


a(x,n)=J A(x, y)e7* dy. 


Hence we have an identification of the operators G(x,D), aeS”, and the 
conormal distributions in /”"(IR2", 4), at least when the kernels have compact 
supports. Thus we recover locally the invariance under passage to the adjoint 
(Theorem 18.3.8). Invariance under coordinate transformations will follow 
when we have discussed the intrinsic meaning of the new coordinates 


(18.3.8). 


If X is a C® manifold and Y is a C” submanifold, then a new manifold 
X, the blowup of X along Y, can be defined as the union of XX Y and the 
projective normal bundle of Y, that is, the quotient by multiplication with 
real numbers +0 of the normal bundle (T(X)|y~ T(Y))/T(Y). If f and g are 
C® functions in X vanishing on Y then f/g is well defined in 


{xeX, g(x) +0} U {teX ~ X, <f,dg> +0}, 


as a limit in the second set. We declare such sets to be open and f/g to be 
in C® there. Together with the C” functions in X lifted to X these 
quotients define a C® structure on X. To see this we choose local coor- 
dinates x=(x,,...,X,) in X such that Y is defined by x’=(x,,...,x,)=0. 
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Then X can be identified with the quotient of IR x S*~! x IR"~* by identifi- 
cation of (t,@, x”) with 


i(t, a, x” )=(—t, —@, x"). 


Note that the projection 7: (t,w,x”) + (tw,x”) € X is defined on X since 
mi=mn. If f,g € C™(X) and both vanish on Y then 


(f/8)(0, 0, x")=<f.(0, x"), o>/<8-(0, x"), o> 


when the denominator is not 0 and then we have g(t w,x’)+0 for small |z1. 
Thus f/g is defined and C® in an open subset of IR x S*—! x JR"—* and even 
under the involution i. Where w, +0, for example, all such functions are C® 
functions of w,/o,(j<k), x,=tw, and x,,1,...,x, so we have indeed a C® 
structure. We could have taken these polar coordinates as the definition of 
the blowup of course, but then we would have been obliged to prove that 
the procedure is coordinate free. 

If X and Y are two manifolds with boundary, we can blow up the 
manifold with corner X x Y with respect to the corner manifold @X x OY, of 
codimension 2. Without embedding X and Y in open manifolds we then 
obtain the stretched product X x Y which is the union of X x YX (dX x dY) 
and the projective interior normal bundle of 0X x dY in X x Y defined as 


(Tru(X lox + Tint (Y lov) ~ T (OX x BY))/T (OX x BY) 


modulo multiplication by positive reals. Choose local coordinates in X and 
in Y such that X is defined by x,20 and Y by y,,20. Then X x Y has local 
coordinates (x’, y’,t,r); t20, —2<r<2; with the map into X x Y given by 


(yt. r) rH @’,td+r/2),y',t — 7/2). 


Thus the C® structure defined by these coordinates is coordinate free and 
agrees with our earlier singular coordinates (see Fig. 2). 


In the particular case where X=Y the closure in X x X of the diagonal 
in (X\@X)x(X~\6@X) is a C® manifold 4c X x X which does not meet the 
corners of X x X, and it intersects the boundary transversally. In fact, with 
our local coordinates it is defined by x’=y' and r=0. The restriction to 4 of 
the C®? map Xx X > XxX is a diffeomorphism on the diagonal A in X 
x X. Now recall that the normal and conormal bundles of 4 in X x X are 
naturally isomorphic to T(X) and T*(X) lifted to A by the projection on 
one of the factors. More precisely, a cotangent vector yeT,*(X) corresponds 
to nt y—n3 yEN,, .(4) where x, and x, are the two projections, and if v is 
a tangent vector to XxX at (x9,x,) then m,,v—72,v is a tangent 
vectoreT,,(X) depending only on the class of v modulo T,, ,,(4). Now the 
C” map X x X +X x X defines maps 


(18.3.23) T(X XX)>T(XxX), THX xX) 4 T*(X X X)I7. 


0» XO 


130 XVIII. Pseudo-Differential Operators 


Let T(X) and T*(X) be the dual vector bundles obtained by pulling the 
normal and conormal bundles of 4 in X x X back to X by the inverse of 
the projection 4-+ X, which is a diffeomorphism since it is a product of two 
diffeomorphisms 4 A —!, X. Then we have natural maps 


(18.3.24) T(X)>T(X), T*(X) > T*(X). 
The first is the composition 
T(X)—> T(X x X)|y/T(A) > T(X x X\/T(A) > T(X) 


where the maps in turn are obtained from the definition of T, the first part 
of (18.3.23) and the discussion of the normal bundle of 4 above. The second 
map is similarly the composition 


T*(X)—> N(4) > N(A) > T*(X) 


where the second map comes from the second part of (18.3.23). With our 
local coordinates the cotangent vector ¢é,dx> at xeX corresponds to <é,dx 
—dy> at (x,x)EX xX and is mapped to <é',dx’—dy'>+té,dr. Thus we 
have in local coordinates the map 


T*(X)a(x, E)H(x’, x’,.x,,0, &, —&,0, x, €,)€N(A) & T*(X). 


Every element in the normal bundle of 4 at (x’,x’,x,,0) has a unique 
representative of the form 


n-1 


» v,0/8x; +0, 0/dr. 


By duality we see that (18.3.24) maps it to 


n-1t 


ys v;0/0X;+X_0,0/0x,€T(X). 


In view of Lemma 18.2.5 it follows that the map T(X)—T(X) sends the 
sections of T(X) to the vector fields which are tangent to 0X. This closes a 
circle; it was on these vector fields that we set out to model an algebra of 
pseudo-differential operators in this section. 

We sum up the preceding conventions and results in the following 


Definition 18.3.17. Let X be a C® manifold with boundary. 

a) The stretched product X x X is a C® manifold with corner obtained 
by replacing 0X x dX in X xX by the interior projective normal bundle. 
There is a natural C” map Xx X> XxX. 

b) The diagonal A in X x X is the diffeomorphic image of a manifold 4 
in XxX which only intersects the new smooth boundary and does so 
transversally. 

c) The compressed cotangent bundle T*(X) is the pullback of the conor- 
mal bundle of 4 in X x X by the inverse of the diffeomorphic projection on 
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X. There is a natural map T*(X)-»T*(X) which maps T*(X) linearly to 
T*(X), bijectively if xeX \ AX and with kernel N,(6X) and a hyperplane as 
range if xedX. The range can then be identified with T,*(X)/N,(0X) = T.*(0X) 
which makes T*(@X) a subbundle of T*(X)|,,. The sections of the dual 
bundle 7(X) are mapped in T(X) to the vector fields tangent to @X. 


The pullback of the symplectic form in T*(X) (see (6.4.8)) to T*(X), thus 
to the conormal bundle of 4 in X x X, is a symplectic form with singularity 
at 6X. In the local coordinates above it is given by 


n-1 


(18.3.25) ¥, dé, Adx,+t-' dp adt. 
1 


Thus the half density |dé’|* |dx’|?¢~*|dp|? |dt|? is invariantly defined. The 
principal symbol of k, which is t~* times (18.3.22), can therefore invariantly 
be identified with the function a on N(A) 0, that is, on the compressed 
cotangent bundle T*(X)\0. 

We have now developed all that is needed for a global calculus: 


Definition 18.3.18. If X is a C®” manifold with boundary then the space 
wr" (X;Q*,Q*) of totally characteristic pseudo-differential operators A of 
‘order m on half densities in X is the set of all continuous linear maps 
C2(X, Q*) > C°(X, Q*) with Schwartz kernel K obtained by pushforward 
from the stretched product X x X to X x X of a distribution half density k 
such that ktteI™(XxX,A) and k vanishes of infinite order on 
a(X x X)\ (0X x 6X). Here t is a C® function in X x X which is positive in 
X x X\(0X x 0X) and vanishes simply on the other part of the boundary. 


The pushforward is defined as follows: If f: X x X > X x X is the natural 
C® map and @¢ is a half density of compact support in X x X, then f* @ is a 
half density of compact support in X x X so the equation 


(18.3.26) K()=k(f* 9) 


defines a distribution half density K in X x X. 
Locally A can be defined by (18.3.1). As we have just seen this leads to a 
principal symbol isomorphism 


(18.3.27) "(X24 O/H (x; OF, O4) = S"(T*(X)/S"- (T#(X)) 


where T*(X) is the compressed cotangent bundle on X. The adjoint of an 
operator in ¥%" is in ¥", and the principal symbol is obtained by complex 
conjugation. If Ae ¥%" and Be ¥" are properly supported then ABe¥"*™ is 
properly supported and the principal symbol of AB is the product of those 
of A and of B; the isomorphism (18.3.27) also holds if one takes only 
properly supported operators in the left-hand side. All these basic facts of 
the calculus as well as extensions where vector bundles are present follow 
just as in Section 18.1. One just has to replace say Theorems 18.1.6, 18.1.7, 
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18.1.8, 18.1.17 by Theorems 18.3.5, 18.3.8, 18.3.11 and the connection with 
conormal distributions on X x X established above. We have also proved 
that all operators in ¥" map H(\"?(X) continuously into H eee. ) if m20. 
The spaces He™?(X), 9'(X), &'(X), CS(X),... are all defined without any 
reference to an extension of X to an open manifold. So is the subspace 


(18.3.28) Af (X)=I"(X, 0X)< D(X) 


of distributions u remaining in LO oe ise n=dim X, after any totally 
characteristic differential operator has been applied to u. (We tacitly allow 
values in a vector bundle but do not wish to burden the notation by making 
this explicit when it is not essential.) By Theorem 18.3.9 every operator in 
Y-° maps &'(X) into W(X ea sf (X), and by Corollary 18.3.15 /™ (X) 


comp 


is mapped into ./”(X) by any "operator in ¥° =|) YF. In these respects the 
k 


relation of (X) to totally characteristic operators is similar to the relation 
of C*(X) to pseudo-differential operators when X is an open manifold. It is 
therefore natural to expect that the dual space of will be a useful space 
of distributions, and we shall introduce it after a preparatory lemma. 


Lemma 18.3.19. There exist linear smoothing operators Q,: 9'(X)> C®(X) 
such that Q,u>u in oS”, 60, whenever ues™ for some m'<m. For 
every compact set KcX there is another compact set K'<X such that 
suppQ,ucK’ if suppucK and 0<e<l. 


Proof. By a partition of unity the proof is reduced to the case X=R",. 
Choose yeCp(R".) with 7(0)=1 and set Q,u=y,*u, 7,(€)=X(e6). Then the 
statement on the supports is obvious. Taking y(x)=¢(x')W(x,), x’ 
=(X,,..-,X,_1), we obtain if 


u=fa(x',é,)emndeé, 
that 
Q,u=fa,(x’,E)e dé; a(x, E,)=Wee) fa(x’-ey', En) oy) dy’. 


If acS" and p>’ it follows in view of Proposition 18.1.2 that a,>a in S“ 
when e-0. 


Definition 18.3.20. By .’(X) we shall denote the set of all we9’(X) such that 
for every compact set KcX and every m2—(n+2)/4 the form 
' CP(K)3¢tu(¢) is continuous in the topology of , defined in (18.3.28). 


We recall from the discussion after Definition 18.2.13 that C2(X)-w™ 
if m= —(n+2)/4 which is the reason for this condition in the definition. 
Since the embedding #") > g™ is continuous when m’ <m the continuity 
condition becomes stronger when m increases, and ¢ ++ u(@) is continuous in 
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the &” topology for every m if Pe C2(KAX°) where X°=X~6X is the 
interior of the manifold. By Lemma 18.3.19 the restriction of u(p) to CP(X°) 
already determines a continuous extension to #” \é uniquely. In particu- 
lar, the restriction 

t'(X)aurou| y-€D(X°) 


is injective. It is really the range of this map which is dual to « which must 
be kept in mind when arguing by duality. 

As topology in .’ we shall use the weak topology defined by the semi- 
norms u ++ |u(¢)| when ¢ € 60 &'. Now C™(X) is a subset of .@’ since the 
topology in #” is stronger than that in- @ Hie jay and Ce(X) is weakly 
dense for if ¢¢ea¢ N€& then u()=0 for all ue C2(X) implies d =0. 


Proposition 18.3.21. There is a unique continuous restriction map 
f'(X)— YX) which agrees with the standard restriction on C®(X). 


Proof. To underline the invariance we consider distributions with values in a 
vector bundle E. If ue C?(X, E) then 


(18.3.29) Ulex, @>=<u,TO>, PECo(OX, E'@ QEX)) 


where 2 is the density bundle and Té=¢@ 6(x,) if x, as usual is a local 
coordinate vanishing on 0X. This is independent of the choice of local 
coordinates since 6(x,) is a distribution density on R. Now the map 


C2(0X, E' @Q(0X))adroThe Lf ?-/*(X, FE’ @A(X)) 


is continuous so (18.3.29) defines a weakly continuous map u ++ ulay. Since 
the uniqueness is obvious this completes the proof. 


Differentiation is continuous from #™” to "+, This implies that the 
space of restrictions to X° of elements in /’ is invariant for differentiation, 
so using Proposition 18.3.21 we can define boundary values of arbitrary 
derivatives of u. (In view of Corollary 8.2.7 this means that the elements of 
sf' have a certain regularity at N(0X) although &’(X°) is a subset of '(X).) 
However, differentiation does not preserve the space ./’ unless one takes 
boundary terms into account as we already did in Theorem3.1.9. To sim- 
plify the statement we take X=R“,: 


Proposition 18.3.22. If ue.f'(IR".) then D; Uti Sin ulx,-0 @6(x,)e.0'(IR", ); the 
restriction to IR", is equal to D, applied to the restriction of u. 


Proof. D;,u is defined by regarding u as an element in 9'(R“.), thus 
<Dju,o>=—<u,D;>, ¢eCF(R4). 


If oy is the element in of defined by ¢, that is, 6,=¢ when x,20 and ¢, 
=0 when x, <0, then 


D;oo=(D; P)o —16 jn P(-,0) @ 5(x,). 
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The inclusion C?(R".)<d™ identifies D;@ with (D; d)o. Thus 
<D; u, > = — <u, D; o> —i 6 j,<U, p(. 3 0)@ 6(x,)>- 


Since 
<u, Pl. ? 0) ® 5(X,)> = Cals 2 O° pl. ’ 0)> = Cul, = 0 ® 5(X,), @>, 
and 
A™ AE (K)ayrtu, Di 
is continuous for every m and every compact set K, the proposition is 
proved. 


Proposition 18.3.23. If Be¥"(X) is properly supported and ueé.sf'(X) then 
Buesd'(X). 


Proof. Bue9'(X) and if ¢¢C2(K), KE X, then 
(Bu, 6) =(u, B* @). 


Here B* 6¢Co(X) has support in a fixed compact set, and the continuity in 
-the ™ topology follows since 
EK) A™ 2 6 Bt EL™ 


is continuous by Corollary 18.3.15. 


Note that (18.3.7) can now be extended by continuity to vE.o/’. 

The elements of ./(X) are smooth in the interior of X and have tangen- 
tial smoothness at the boundary while those in .o/’(X) have normal deriva- 
tives on OX of all orders. This suggests the following 


Proposition 18.3.24. On any C® manifold X with boundary one has 

w'(X) nf (X)= C°(X). 
Proof. It is obvious that C*(X) co (X) 0 A(X). To prove the opposite 
inclusion it suffices to show that if ued (RY)AW(RY)OAS(R4) then 
ueCP(IR".). Since ues we have ueC%”(IR*%). From Proposition 18.3.22 it 
follows that for every « there is an element u,¢.v’ such that D*u—u, has 


support in dR", . ; ; 
Choose m so that ue./™. Then v=x% Dues ™t*—), that is, 


DE(x,D, PvE He mn —njaycL? for all B, 


where the last inclusion holds if N>a,+m-+n/4. Let yeCPdR), OSyS1, 
and y=1 in a neighborhood of 0. Set 7°(x)=x(x,/e). Then 


DE (x, D,)'" xv is bounded in I? <”H,o) for all B, 0<eS1. 


In fact, (x, D,,¥°]=%4 if x,Q=tD,x(O so we can commute x’ through to the 
left, obtaining a sum of similar terms with a factor yj to the left instead, 7,(¢) 
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=(tD,) x(t) and the L? estimate is obvious for them. Thus (1—y‘)v is 
bounded in o&-"* for 0<eS1. Since u,e.' and D*u=u, when x,>0, it 
follows that 
(D* u,(1 —x°) v) = J [D* ul? x¥(1 —x°(x,)) dx 
is bounded when ¢->0, hence 
{ |D*ul?x¥ dx<oo if N>a,+m-+n/4. 
Xn>O 


Now we recall Hardy’s inequality for functions veC*(IR_,) vanishing far 
away, 


flo)? e74#des4(2u41)? [oP t24*7 dt, 20. 
0 ce) 


In the proof one can assume that v is smooth at 0 and integrate by parts, 
which gives 


lv t* |]? = —2(2n41) f Revo’ t?4*! dt<2/(2ut1) Il" vo} eer! v']. 
‘ 0 


Hardy’s inequality follows after cancellation of a factor. Now we obtain 


J |Dtul?dxs4 J |D*DYul?x2%dx<oo 
Xn>0O Xn>0 
if v is a positive integer with 2v>a,+v+m-+n/4, that is, v>a,t+m+n/4. In 
view of Theorem B.2.8 it follows that u= U,+U, where U,eC?(IR“), consid- 
ered as an element of /’, and x,=0 in supp U,. Thus U,e./’ so U,=0 in 
view of Lemma 18.3.19, which completes the proof. 


For the wave front set of a distribution in a manifold with boundary we 
can now give a definition parallel to the characterization in Theo- 
rem 18.1.27: 


Definition 18.3.25. If ue D’(X) then WF,(u)c T*(X) is defined by 
(18.3.30) WF,(u)=() Char B 
with intersection over all properly supported Be ¥°(X) such that Bue.(X). 


Here Char Bc T*(X)\0 is the set of all (x,é)¢T*(X)\O such that the 
principal symbol 5b of B is not invertible at (x, €) (cf. Definition 18.1.25). We 
have used the space (X) and not C°(X) because it contains the residual 
terms in the calculus by Theorem 18.3.9. Thus it follows at once from the 
definition and Theorem 18.3.9 that WF,(Au)CI if the symbol of: A is of 
order —0o outside the closed conic set ! < T*(X)\0. However, C*(X) may 
be used if we assume more about u: 


Proposition 18.3.26. If ue.d'(X) then (18.3.30) is valid with the intersection 
taken over all properly supported Be '¥,0(X) such that BueC®(X). 
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Proof. If ue.d'(X) and Bue(X) as in Definition 18.3.25 then BueC”(X) by 
Proposition 18.3.24, for Bue.d'(X) by Proposition 18.3.23. 


Theorem 18.3.27. If X is a C® manifold with boundary 0X and interior X°, 
and ueQ'(X) then 

(i) WE,(Wly. = WF(ul,-). 

(ii) WE,(Bu) < WF,(u) if Be'¥%"(X) is properly supported or if B is a differ- 
ential operator with C® coefficients. 

(iii) If WF,(u)=0 then ue (X), hence ueC”(X) if uect'(X). 

(iv) WF(uljy) < WE,(u) A T* (0X), uesl’(X). 


Note that T*(X)|x- is identified with T*(X °) and that the map T*(X)|5x 
+ T*(X)|ox defines an embedding T*(0X)— T*(X)lox. 


Proof of Theorem 18.3.27. (i) follows since every Be'¥°(X°) with kernel of 
compact support in X° x X° is also in ¥,°(X) and vice versa. To prove (ii) 
assume that yeT*(X)\O\ WK(u), and choose B,¢¥%°(X) properly sup- 
ported and non-characteristic at y so that B,ues/. We can then find 
C,¢'%° properly supported. so that C,B,=1+R, where the symbol of R, 
(in local coordinates) is of order — oo in a conic neighborhood of y. Thus 


CBu=CBC, B,u—CBR,uea 


by Corollary 18.3.15 and Theorem 18.3.9 if the symbol of C is of order —oo 
outside a sufficiently small conic neighborhood of y. Hence yéWF,(Bu). It 
remains to prove (ii) when X =IR",, ueQ'(R",) and B=D,. If a(x,D)ueaw 
and a is non-characteristic at (0,€,) then we can choose ne &) non-charac- 
teristic at (0, &) so that b(x, £)=b(0, 6) when x is in a neighborhood U of 0, 
and the support of b is in a cone where a is non-characteristic. Then we 
have b(x, D)= (x, D) G(x, D)+#(x, D) where ceS° and reS;,”, so b(x, D)ued 
by Corollary 18.3.15 and Theorem 18.3.9. Choose yeCo(U) with x(0)+0. 
Then we have by (18.3.6) 


7.b(x, D) Dju= x D,b(x, D)u+7id;,6(x, D) D,u. 


Since yb and y(b—ib™) are non-characteristic at (0,é,) the statement (ii) 
follows. To prove (iii) we choose for @eC?(X) properly supported 
Be W(X), j=1,...,N, with B,uew and (\CharB, empty over supp ¢. 
Then we can choose C,,. Cy? (X) properly supported so that 


C,B,+...+CyBy=@+R 


where Re¥,-” and @ denotes multiplication by ¢. Since C,B, ue and 
Ruew, by Corollary 18.3.15 and Theorem 18.3.9, it follows that pued. The 
last statement in (iii) follows from Proposition 18.3.24. To prove (iv), finally, 
we assume that yeT*(0X)~0 is not in WF,(u). Then we can find Be '¥°(X) 
properly supported with yéChar B and BueC™(X). We have 


Buloy=Bo(ulax) 
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where B,e¥°(0X); the principal symbol is equal to the restriction to 
T*(0X) of that of B by (18.3.7). Hence By is non-characteristic at y, so 
y¢WF(ul,y). The proof is complete. 


From (18.3.30) it follows at once that 
(18.3.31) WE,(u)cWF(Bu)UCharB, ue'(X), 


if BeW"(X) is properly supported. Assume now instead that P is any 
differential operator with coefficients in C*(X), of order m, and let ueQ’(X), 
Pu=f. If ¢ is a C® function vanishing simply on 0X, then 6” PeDifff and 
bo” Pu=" f. With the usual local coordinates at the boundary we can take 
=x, and have 
x™ P(x,D)= Y x™a,(x) Dtu=¥ x™—™ a, (x) x2" D* u 
ja] Sm 

which shows that the principal symbol when x,=0 is a,(x)é", where « 
=(0,...,0,m). Thus it is identically 0 in some fibers unless 0X is non- 
characteristic, and then it vanishes precisely when ¢,=0, that is, in T*(0X). 
Thus we have 


Proposition 18.3.28. Let P be a differential operator of order m with coef- 
ficients in C®(X), let 6EC*(X) vanish simply on 0X and assume that 0X is 
non-characteristic for P. Then 


(18.3.32) WE, (u)lox ¢ WE,(@" Pu)gy T*(OX), ue D(X). 


We shall now prove a result which is closely related to Theorem 4.4.8’. 
As in that statement we just assume partial hypoellipticity at the boundary 
and not that the boundary is non-characteristic. Let X, be an open set in 
R"~! and set X=X,x[0,c)CR", considered as a manifold with boundary © 
0X =X, x {0} and interior X° =X, x (0, c). 


Proposition 18.3.29. Let ueQ'(X°) satisfy a differential equation of the form 
Pu=D™ut+a,_,D™-'ut...+a,u=f in X° 


where a, is a differential operator in x'=(x,,...,X,_4) with coefficients in 
C”(X), and fesd'(X). Then there is a unique Uesd'(X) with restriction u to 
X°. We have x™(PU —f)=0 in Q(X), and there is no other UeQ'(X) with 
this property. Here u may have values in C%, the coefficients of a; being N 
x N matrices then. 


Proof. It suffices to prove the proposition when m=1. In fact, if u,=Di u, 
0<j<m, then 


Dy Um— 1+ » ajuj;=f; D,Uj=Ujyts j<m-l. 


jsgm-1 
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The statement with m=1 will then give extensions Uje.o’ satisfying these 
equations multiplied by x,. In particular, x,(D, U;— U, 41)=0 for j<m-—1, 
hence 


xy U;=x4,.D, Uj_1 =... =x, Dy" U, = xq Dy Up, j<m, 


for if k<j then x} Di can be rewritten as a sum with a factor x, to the right. 
Thus x”(PU, —f)=0. If Ve9'(X), supp Vc éX and x" PV=0 then V=O. In 
fact, if YE X we can by Theorem 2.3.5 write in Y 


BL 
V=)>v,@6(x,). 
0 
Now 
Xn DEO? (X) = Cuyj OF E—™ 


which should be read as 0 if m>k+/j; here c 
x" PV =0 gives v,=90, hence V,-1=0 and so on. 

From now on we assume m=1. Let K,EX, and set K=K, x [0,c/2]. 
By Lemma 18.3.19 it suffices to show that ¢ +> (u,@) is continuous in the 
topology of & for every k if p¢C%(K). Recall that the semi-norms in this 
topology are 


(18.3.33) pro ® |lxin D* bln _njay: 


By hypothesis |u(@)| < C ||@]|,,. for some s so the continuity is clear when k< 
—s-—n/4. We must prove that continuity in ./ follows from continuity in 
xl") To do so we set W=id+(5@ H(x,)), that is, 


-+0. Thus the equation 


mmj 


Wix)=i f dlendt. 


Then D, w=. Let yeC{(—c,c) be equal to 1 in (—c/2,c/2). With 
An(X) = Z(x,) we have for GE Co (K) 


<u, > =U, %,D,W> = — CDi 4s Xn > — <u, yD, X> 
> Ch kn P> + U, Ag (KY) -V Di Xn? 


Writing H(x,)=ho(x,)+h,(x,) where h,eC® and the support of hy is close 
to 0, we can estimate y in C”(X°) by the semi-norms (18.3.33) on ¢. We 
can also estimate 


zs |D; xn" D*(X, W)ll(ne—njay 


in this way for arbitrary j and «. It suffices to do so for j=n for this implies 
an estimate of *||xi-D%(x,W)ll(_~—nya)- Since x,=1 near 0 it suffices to 
estimate 


WXn Dy Xn" D® WI nay: 
Now 
D,, xt" Diy = xi Di h —ia, xir-! Dén-3 
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where the last term should be dropped when «,=0, so this estimate is 
obvious. Thus the semi-norms ®||x% D*(x, W)ll¢, —¢—n/a) Of XW in o*-” can 
be estimated in terms of the semi-norms (18.3.33) of ¢, which completes the 
proof. 


In the non-characteristic case it follows from (18.3.32) that WF,(u)lsx 
c T*(0X) if feC”(X) We introduce a notation for such distributions: 


Definition 18.3.30. If X is a C® manifold with boundary 0X then W(X) is 
the set of all ue.of'(X) with WF,(u)|,, < T*(0X). 


From Propositions 18.3.28 and 18.3.29 we obtain 


Corollary 18.3.31. Let P be a differential operator with C® coefficients in X 
such that 0X is non-characteristic. If fe W(X) and ueQ'(X°) satisfies the 
equation Pu=f in the interior X° of X, then u has a unique extension 
UE N(X). 


In Chapters XX and XXIV we shall study the wave front set when u 
also satisfies boundary conditions. It will then be technically simpler - at 
least in the present state of the art - not to use the nice invariant definitions 
of this section but rather work with pseudo-differential operators along the 
boundary. We have seen in Theorem 18.1.36 how they act on the wave front 
set in the interior and shall now study the behavior at the boundary. 


Theorem 18.3.32. Let X be an open subset of R", and set 0X =X NOR". If 
ueN(X) and beS™(X xIR"~*) defines a properly supported operator b(x, D') 
in X, then b(x, D')ue V(X) and 


(18.3.34) WE,(b(x, D') u)lay < WE, (Ulax OD 


if I is a closed cone <€X xIR"~} such that b is of order —co in a conic 
neighborhood of (@X x IR"~')\T in X x IR"~!. On the other hand, 


(18.3.35) WE, (Wlay c WE, (B(x, D’) u)| 4x U Char by 
where bo{(x’, €') = b(x’, 0, €°). 


Proof. Since WF,(u) is closed in T*(X)\O and the restriction to 0X is a 
subset of T*(GX), the set 


Y={xeX; x,=0 or 3x,|E,|<|¢'| when (x, 2)eWF(u)} 


is an open neighborhood of @X in X. If weCg(Y) we have uy 
=WwueN(X)né(X), and the wave front set of Yu in X~OX has no 
element (x,&) with é’=0 causing trouble in Theorem 18.1.36. Choose teS® 
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so that 
(18.3.36) ¢t(x,6)=1 when 2/6)<|é,/@]>1, x,<1; 
r(x,€)=0 when [é,|>1é'L 
We can take t(x, ) independent of the x’ variables. If v=t p(X, D) uy we have 
WF, (uy —v) = WF, ((1 —1, (x, D)) uy.) =0 


in view of Theorem 18.3.27(i) and the first part of (18.3.36). Hence 
Uy —ve a’ AA= C(IR"_); it follows from Theorem 18.3.5 that the difference 
is in Y(IR",). If beS*"(R" x R"~') then 


(18.3.37) b(x, D') t,(x, D) uo = a(x, D) uo 


where a(x, ¢)=b(x, ¢’)t,(x, eST, by the second part of (18.3.36). In fact, if 
wef then the Fourier transform of t,(x, D)w with respect to x’ is 


(22) * fee, (x,, 6) 0(E) dE, 


so (18.3.37) is valid with u, replaced by w. Letting wu, in # we obtain 
(18.3.37). Since —f, (x, D) up —upe ZR") we obtain (x, D')ug 
~— (x, D)upeS(IR".), hence b(x, D’)up¢ V(X) and 


WE,(b(x, D') Ulex WE, (U) OL 


since a is of order —oo in a conic neighborhood of {(x,é); x,=&,=0, 
(x’, €)€L}. This proves (18.3.34). 

Let (y’,n')ET*(0X)~\0 be a point not belonging to the right-hand side of 
(18.3.35). Then we can choose weC?(Y) equal to 1 in such a large subset 
that b(x, D’)(ug —u) vanishes in a neighborhood of (y’,0) if up =u as above. 
Then (y’,n)¢ WF,(b(x, D’)u.)= WF,(d(x, D) uy), and a is non-characteristic at 
(y’,0,7',0) since by is non-characteristic at (y’,n’). Hence (y’,n)€WF,(u,) as 
claimed. 


Remark. In the open subset of X where b(x,D')u is determined by the 
restriction of u to {xeEX°; (x,ODéWF(u) when &’=0} we conclude using 
Theorem 18.1.36 that WF,(b(x, D')u)< WE,(u). However, the set where this 
holds depends on u. 


Corollary 18.3.33. If ueN(X) then (y',n)EWF(u) if and only if 
b(x, D‘)ueC”(X) for some properly supported tangential pseudo-differential 
operator b(x, D') which is non-characteristic at (y',0,7’). 


Proof. In view of Proposition 18.3.24 this is an immediate consequence of 
Theorem 18.3.32. 


Our definition of WF,(u) is obviously coordinate free. Hence this is also 
true for the alternative definition given by Corollary 18.3.33 which by Corol- 
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lary 18.3.31 is applicable to solutions of differential equations with reason- 
ably regular right-hand sides. The invariance is all that we shall actually use 
of the results of this section, but the general philosophy will also be helpful 
in discussions of regularity at a boundary. 


18.4. Gauss Transforms Revisited 


The reader will have noticed that the calculus in Sections 18.1 and 18.3 was 
mainly based on the results in Section7.6 on the Gauss transform 
exp(i<D,, D,>) in R2". Indeed, the proof of Theorem 18.1.7 was based on 
Theorem 7.6.5 and a localization argument; the multiplicative properties 
were then reduced to Theorem18.1.7. In this section we shall make a 
systematic study of the localization properties of exp(iA(D)) when A is a real 
quadratic form. The results will be used in Section18.5 to extend the 
calculus of Section 18.1 to more general symbols and to develop an alter- 
native to it, the Weyl calculus, which in many respects has better properties. 

To motivate the definitions we first recall that by Definition 18.1.1 the 
symbol class S” is the set of all C® functions a in IR" such that 


a (x, USC p(L+1EN"—!; x, EER" 


To reinterpret this condition we introduce at (x,é) an orthonormal basis 
with respect to the metric 


(18.4.1) |dx|? +|dE|?/(1 + E|?). 


Then the derivatives of order k with respect to the new coordinates can be 
estimated by C,(1+||)" for some C, independent of €. Our generalization 
consists in considering in a finite dimensional vector space V any slowly 
varying metric in the sense of Section 1.4. By Lemma 1.4.3 it is no restriction 
to assume that it is Riemannian, that is, that for every xeV we have a 
positive definite quadratic form g,(y) in yeV. 


Definition 18.4.1. g is said to be slowly varying if there are positive constants 
c and C such that 


(18.4.2) 8.(y) Sc 84, (0S C8, (0. 


This is precisely the condition in Definition 1.4.7 for the metric |y]|, 
=(g,(y)/c)?. Decreasing c if necessary we may therefore as observed in 
Section 1.4 give (18.4.2) a symmetric form: 


(18.4.2) 8.(y) Sc => g,(0/C Sg, , (0S Cg(. 
An example is the metric (18.4.1) or more generally 
(18.4.1) ldx|?(1 +187)? +1dE7721 +167)? 
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if p $1. Indeed, if g, -(y,4)Sc then In|? Sc(1+|€|?) which implies 
(1+ e)/2s1+]¢+n) S20 +1¢)) 


if c<}. The slow variation follows at once. 
If G is a fixed quadratic form and ueC* in a neighborhood of xeV we 
’ shall norm the k'® differential at x by 


k 
laf) =sup ll e515. | 160). 
tye 1 


For fixed k an equivalent norm is of course the maximum of the derivatives 
of order k with respect to a G orthonormal coordinate system. Leibniz’ rule 
gives 


(18.4.3) luolf()sy (;) ul? (x) [olf (x). 
0 


If u(x)=1 and we put u=1—», then the k'" differential at x of 1/u is equal 
to the k' differential of }’ v/ which can be estimated by a sum of products 


ik 
,, with j,21 and }\j,=k. Thus 


(18.4.4) [L/ul? (x) SC, (lulf Ox) +... +1ulF Os 


if u(x)=1. For general u we obtain a bound by homogenizing this estimate. 
When g is a Riemannian metric we shall write |u|%(x) for |ul@(x) when G 
=g,. Now we define corresponding symbol :classes as follows: 


lv], --- lel 


ict 


Definition 18.4.2. If g is slowly varying then a positive real-valued function m 
in V is said to be g continuous if there are positive constants c and C such 
that 


(18.4.5) 2,(y)<c > m(x)/C <m(x+y)<Cm(x). 


We define S(m,g) to be the set of all ueC*(V) such that, for every integer 
k20 


(18.4.6) sup |u| (x)/m(x) < 00. 


It is obvious that S(m,g) is a Fréchet space with the topology defined by the 
quantities in (18.4.6). It is important to note that the seminorms are indexed 
by the non-negative integers k so that it makes sense to talk about “the 
same seminorms” in the spaces S(m,g) with different m and g. 


If g is the metric (18.4.1) we can take m=(1+{é|*)“/? for any real 
number y. Then S(m,g) becomes the symbol space S% , introduced in Sec- 
tion 7.8 already. 

The following lemma is an immediate consequence of (18.4.3) and 
(18.4.4): 


Lemma 18.4.3. If ueS(m,g) and veS(m',g) then uveS(mm’,g). If 1/\ul<C/m 
for some C, then 1/ueS(1/m, g). 
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It is clear that C?(V)<S(m, g), for g, and m(x) are bounded from above 
and from below when x is in a compact set. Assuming always that (18.4.2) 
is fulfilled we can apply Lemma 1.4.9 and Theorem 1.4.10 to |y|, =(2,(y)/o)? 
and obtain 


Lemma 18.4.4. If 0<e<1 one can find a sequence x,,X,,...€V such that the 


balls 
B,={x3g,,(x,-x)<R7} 


cover V if ec<R? and the intersection of more than N, balls B, is always 
empty if R?<c, If 2¢¢<R*<c one can choose non-negative ¢,¢C%(B,) with 
> ¢,=1 so that for all v and k 


(18.4.7) IOS Cre: 


The partition of unity can be used to regularize the metric and the 
weight function m. In fact, if we set 


m,(x)=)' p(x) m(x,) 
and observe that g, (x —x,)<c when $,(x)+0, it follows from (18.4.5) that 
m(x,)//C Sm(x)SCm(x,) in supp ¢, 
which implies that 
m(x)/C Sm, (x) Cm(x). 
In addition we obtain 
| [e(x) SC, m(x), 
which means that m,eS(m,g)=S(m,,g). In particular, we conclude that 
S(m, g)< S(m’, g) if and only if m/m’ is bounded. In the same way we can of 
course regularize the metric g. 


If weS(m, g) it follows from (18.4.7), (18.4.3) and (18.4.5) that u,=@,u has 
the bounds 


(18.4.7) lu lE(x)SC,m(x,) for xeV, all v and k. 


Here g may be replaced by g,. Conversely, if we have any sequence of 
functions u, with supports in the balls B, with R*<c and satisfying (18.4.7), 
then u=)'u,eS(m,g). It is therefore clear that optimal estimates for linear 
functionals on symbol classes must be obtained by studying just functions 
with the properties of u,. We shall do so for the Gauss transforms. 


Let A be a real-valued quadratic form in the dual space V’ of V. Then 
A(D) is a differential operator in V characterized by 


A(D) exp <ix,¢> =A(d)expdix,¢>, xe, 
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for every fixed eV’. When u is in Y or in Y’ we can define exp(iA(D))u as 


the inverse Fourier transform of exp(iA(é))a(€) where a is the Fourier 
transform of u. Let g be a positive definite quadratic form in V, and let 


K={x; g(x)< 1} 


be the corresponding unit ball. By (7.6.7) applied to a g orthogonal coor- 
dinate system we have for ueCo(K) 


(18.4.8) lexp(iA(D)) u— ¥. (GA(D) u/jI 
j<k 


< Csup sup|A(D)* ul%(y)/k!. 
js yek 


Here s is an integer >dimV/2. Outside K the sum vanishes, and we shall 
then improve (18.4.8) by an argument equivalent to that used to prove 
Lemma 7.6.4. However, we phrase it differently to prepare for another ana- 
logous proof in Section 18.6. 

Let L be a real affine linear (i.e. not necessarily homogeneous) function 
in V. Then 


[exp(iA(D)), L]=exp(iA(D)) <A‘(D), E> 


where L is regarded as a multiplication operator and LeV’. This is a case of 
(18.1.6) but of course perfectly elementary, for after Fourier transformation 
the identity becomes 


Lexp(iA(é)), L(— D)]=exp(iA(¢)) <A'(), E>. 


Set L(y)=<y—x,4> and assume that L+0 in K. We have <A’(D),y> 
=2A(D,n)=2¢An, D> where A( , ) is the symmetric bilinear form defined 
by A in V’ and A is the corresponding linear transformation V'> V. Thus 
the preceding formula gives 


(18.4.9) exp(iA(D)) u(x) 
=2exp(iA(D))(<An, D> L-*u)(x), ue CE(K), 


and this result may be iterated any number of times. 


Lemma 18.4.5. If L is linear and never 0 in {y; g(y)<R*} where R>1, then 
(18.4.10) |LOOVLE(y) Skt RA(R—-1**', ~yeK. 


Proof. It is no restriction to assume that g(y)=)) yj and that L{y)=1—-ay, 
where 0SaR<X1. Then (18.4.10) follows since a*(1—ay,)~*-1<R/A(R—1)**! 
when ly,{<1. 


If we iterate (18.4.9) and use (18.4.10), (18.4.3), and (18.4.8), with k=0, it 
follows after k iterations that 


lexp(iA(D)) u(x)| SC, 2(g(A m)?/ILO)I)* SOD Sup luff, ueCo(K). 
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To examine how small the new factor g(A 7)*/|L(0)|=2(An)?/|<x, >| can be 
made we introduce the dual form of E->g(A é) 


(18.4.11) gA(x)= sup <x, é>?. 
g(Ae<1 


g* is of course +00 except in the orthogonal space of the radical {£; AE =0} 
={é; A(é)=0}, that is, the range of A. If we set x=An and note that 
<An, €> =CAE,n> we obtain 


gt(A n)=sup |<ysn/"/a(y) 


so g4 is the composition of A~' and the dual form of the restriction of g to 
the range of A. Note that sup, <x, £>?/g4(x)=g(A ). 

Now assume that the g* distance from x to RK is 2a>0. By the Hahn- 
Banach theorem we can then find 7 so that 


<y,n><<x+z2,n> if g(y)<R? and g4(z)<a’. 
This means that 
<ys n> <<x,n>—ag(An)*, yeRK. 
For L(y)=<y —x, > it follows that 
L(0)= —<x, n> < —ag(An)* 


We have therefore proved that for k=0,1,... 


(18.4.12) jexp(i A(D)) u(x)| 


SC, 2(1+ inf g4(x—y))~*? sup sup ul, 
yeRK Jsst+k 
if weC 9 (K). We sum up our results so far: 


Proposition 18.4.6. Let g be a positive definite quadratic form in V and A a 
real quadratic form in V’. Denote by K the unit ball with respect to g, and 
define g“ by (184.11). Then the estimates (18.4.8), (18.4.12) are valid for all 
k20 and R>1 if 2s>dimV. 


Since exp{iA(D)) commutes with differentiation it would have been easy 
to add estimates for the derivatives. The important point in (18.4.12) is that 
the right-hand side is very small at large g4 distance from a neighborhood 
of K. This localization property allows us to get estimates for e'4) in 
appropriate symbol classes; in a special case it was already used in the 
proof of Theorem 18.1.7. The following is the required condition. 


Definition 18.4.7. The Riemannian metric g (and the positive function m) in 
V is said to be A temperate (resp. A,g temperate) with respect to xeV if g is 
slowly varying (and m is g continuous) and there exist constants C and N 
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such that for all y,teV 


(18.4.13) g,(t)SCg.(O(1+aA(x—y))", 

(18.4.14) m(y)< Cm(x)(1+-g4(x —y)). 
Note that (18.4.13) implies 

(18.4.13) set) S Cath +gt(x—y). 


Conversely, (18.4.13)' implies (18.4.13) if A is non-degenerate. When t=x—y 
we obtain in particular 


(18.4.13)" L+gi(x—y)SC(L+93(x—y)"*. 


To remove the condition on the support of u in Proposition 18.4.6 we 
shall use the partition of unity in Lemma 18.4.4. Choose Ry with R<R,<c? 
and introduce in addition to the balls B, containing supp@, the neigh- 
borhoods 

U,={x38.,(%—-x,)SRo}, US ={x58,,(¢—x,) Sc}. 


When ueC? we shall apply (18.4.12) to u,=¢,u with R and g replaced by 
R,/R and g,,/R’. To add up the estimates for exp(iA(D))u, we need the 
following lemma: 


Lemma 18.4.8. Assume that g,<g4 and that g is A temperate with respect to 
x. Then there are constants C and N depending only on those in (18.4.13) such 
that 


(18.4.15) Vat) "SC,  d,(x)= inf g4(x—y). 
v yeUy 


Proof. We may assume that g, is the square of the Euclidean norm | |, 
which is then a lower bound for g4. Let k=1 and set 


M,={v; d,(x) Sk}. 
When veM, we choose y,éU, with g4 (x—y,)<k. By (18.4.13) 
8, (t)S Ctl? k®. 


Now y,+zeU; if g, (z)<(c?—R,)’, and since g is slowly varying we have 
fixed upper and lower bounds for g,,/g,,. It follows that a Euclidean ball V, - 
of radius c,k~%/? and center at y, is contained in U’. In view of (18.4.13)" 
we have 

Ix—y,? =g,(x—y,)Sat(x—y,)S CHT; 


hence the balls V, are contained in a Euclidean ball of radius C’k(¥*1¥/? 


with center at x. Since we have a bound for the number of U; and therefore 
for the number of V, which can overlap, we obtain 


cM, k-"? <¥ m(V,)s Cm VS C kt 9?, 
My My 
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if |M,| is the number of elements in M,. Hence |M,|<Ck’~! for some new 
constants C and N. If we add successively the terms in (18.4.15) with 
veM,,M,~M,,...,.Mye~M..-1, the estimate follows. 


With u,=¢,u we obtain using (18.4.12) 
(18.4.16) Jexp(iA(D))u,(|SC,(1+d,(x))~"? sup sup|u, |. 
jSstk Uy 


Since g varies slowly we may replace g, by g here. With y, defined as 
above we have 


m(y)SCmly,)SC'm(x)(1+d,(x))",  yeU,, 


provided that m satisfies (18.4.14). If the hypotheses of Lemma18.4.8 are 
also fulfilied then (18.4.16), (18.4.15), and (18.4.7) yield 


(18.4.17) > |exp(@A(D)) u,(x)| S Cm(x) sup sup |uli/m, 


provided that k is large enough. 

We shall use (18.4.17) to extend the definition of exp(iA(D)) from C? to 
S(m, g). However, an arbitrary continuous linear form on S(m,g) is not 
determined by its restriction to C? for this is not a: dense subset. We shall 
therefore be interested in a stronger continuity condition: 


Definition 18.4.9. A continuous linear form on S(m, g) will be called weakly 
continuous if the restriction to a bounded subset is continuous in the C™ 
topology. 


A weakly continuous form is determined by its restriction to C?, since 
the partial sums of the partition u=)'u, are bounded in S(m, g) if ueS(m, g); 
they converge to u in the C” topology since they are ultimately equal to u 
on any compact set. 

The proof of (18.4.17) actually gave a convergent majorant series for the 
left-hand side of (18.4.17) which is valid for all u in a bounded set in S(m, g). 
It follows that the sum 


exp(i A(D)) u(x) = }) exp(i A(D)) u, (x) 
defines a weakly continuous linear form on S(m, g). Thus we have proved: 


Theorem 18.4.10. The map Cf autoexp(iA(D))u(x)eC has a unique extension 
to a weakly continuous linear form on S(m,g) for every x such that g is A 
temperate, g,<g*, and m is A, g temperate with respect to x. We have 


(18.4.18) lexp(i A(D)) u(x)| Sm(x) lull, 


where the seminorm |u|| in S(m,g) only depends on the constants in (18.4.2, 
(18.4.5), (18.4.13) and (18.4.14). 
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The hypotheses in Theorem 18.4.10 also allow us to estimate the deriva- 
tives of exp(iA(D)) u(x). Let us first assume that ueC?P so that we know that 
they exist. Then we have 


k 
(18.4.18) |CD,t,>...<D,t,> exp(iA(D)) u(x) Sm(x) [[ 2.) lull, 


where ||u/| is a seminorm in S(m,g) which depends only on k in addition to 
the constants in (18.4.2)’, (18.4.5), (18.4.13), (18.4.14). To prove this we set 


k 
v=<D,t,>...<D,t,>u, m'(y)=m(y) | [8,(t). 


m’ is also a A, g temperate function with respect to x, with bounds depend- 
ing only on the bounds for g and m in addition to k. Any seminorm of v in 
S(m’,g) is bounded by a fixed seminorm of u in S(m,g). Hence (18.4.18) 
applied to v gives (18.4.18)’. (It is sometimes useful to observe that it is 
sufficient to have (18.4.6) for the differentials of order 2k.) The following 
statement is an immediate consequence: 


Theorem 18.4.10’. Assume that the hypotheses of Theorem 18.4.10 are fulfilled 
uniformly for all x in a linear subspace Vy of V. Then the map 


S(m, g)aurexp(iA(D)) uly, 


is weakly continuous with values in the space S(m,g)ly, of symbols in Vo 
corresponding to the restrictions of m and of g. 


For the proof we just have to take t,,...,t,€V in (18.4.18) and note that 
on bounded subsets of S(m,g) the C® topology is equal to the topology of 
pointwise convergence. 


The preceding results can all be improved when 
(18.419) h(x)’ =sup g,(t)/g2(t) 
t 


is not only less than or equal to 1 but is small. (If the coordinates are 
chosen so that g, is the Euclidean metric and A(é)=}'b,éj, then h(x) 
=sup|b,|.) First of all we note that h is A, g temperate with the same 
constants as in (18.4.13). In fact, h is obviously g continuous. We can write 
(18.4.13) in the form g,<Mg, which implies g4>g4/M and g,/g4<M? g,/g4, 
hence h(y)SMh(x). Let us now return to the estimate (18.4.16). Recall that 
there is a bound for the number of overlapping balls U} and that 


8.,(x-y)2(c*?-R,)?=c,>0 when x¢U; and yeU,. 
This implies that 
C2 S8,(x—y) Sly)’ g(x -y)SCh(x)?(1 + e4(e—y)™, 
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hence that 

1<Ch(x)*(itd,(x)" — if x¢ UV’. 
This means that (18.4.17) can be improved by any power of h(x) in the 
right-hand side if we only sum over all v with x¢€Uj. The remaining bound- 


edly many terms can be estimated by means of (18.4.8) with A(D) as above 
which proves that the remainder term 


(18.4.20) Ry=exp(iA(D))u— Y (iA(D)¥ uff! 
J<N 
has the bound 
(18.4.21) IRY? (x5 ty, -.-5t,)| Sh(x)" m(x) Tleett)* lull, 
1 


where |u|} is a fixed seminorm in S(m,g). (The proof for k+0 is reduced to 
the case k=0 as in the proof of Theorem 18.4.10’.) Thus we obtain 


Theorem 18.4.11. Assume that the hypotheses of Theorem 18.4.10 are satisfied 
uniformly for all x in a linear subspace V, of V, and define h by (18.4.19). With 
Ry defined by (18.4.20) it follows that 


S(m, g)auroRyeS(mh", Bly, 


is weakly continuous. The seminorm in (18.4.21) depends only on N, k and the 
constants in (18.4.2), (18.4.5), (18.4.13) and (18.4.14). 


Thus it is justified to calculate exp(iA(D))u(x) by the forma! expansion 
where h(x) is small. An example is Theorem 18.1.7. There the metric is 
(18.4.1), and A is the quadratic form (x, &)><x, > in R"@R", so A is the 
map (x, &>(é,x) and g4=(14+|é|?)|dx|?+|dél?. Thus h?=(1+|é|?)-1, and 
(18.1.11) follows from Theorem 18.4.11 with the weight function m(= 
(1+1é|?)"?, if m is replaced by » in Theorem 18.1.7. In fact, (18.4.13) and 
(18.4.14) follow with N=1 and N=y/2 respectively since 


(L+1817 0 +17) Sd + nl +18 — al)? +1?) +16 — al)’. 


More generally, the metric g defined by (18.4.1) is slowly varying if p<1. 
We have 
g*a(L +e)? Idx? +1 +117)" 71a c?, 


hence h? =(1+|é|?)°-*<1 if and only if 6<p. The condition for g to be A 
temperate is 
(1416/7)? +] l?)? +0 +1817) + ||?) ? 
SC +(E—n/? +1917)- 9. 


If |€|<|n|/2 it is valid if and only if 6<N(1—6), that is, dS N/(N+1). When 
\y\/2<|E|<2|y| it is true for large C, and when |¢|>2|n| it follows if N2p. 
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When the condition is fulfilled then (1+{c|*)/? is A,g temperate. Hence 
Theorem 18.1.7 remains valid if O0<éd<p<1 and 6<1, as observed before 
Theorem 18.1.35. 


18.5. The Weyl Calculus 


Let V be an n dimensional vector space over R and V’ its dual. In 
Section 18.1 we associated with any aeY(W), W=V@V’, the operator 


(18.5.1) a(x, D)u(x)=(22)-" ff a(x, de" uly)dydé, ue. 


(Here dy is a Lebesgue (Haar) measure in V and dé is the dual one in V’ 
such that Fourier’s inversion formula holds with the usual constant. Replac- 
ing dy by cdy one must change dé to c~! dé so dydé is invariantly defined.) 
The weak version of (18.5.1) 


<a(x, D)u, v> =(22)~-" fff a(x, el-” © uly) v(x) dy dx dé 
=(2n)-"fffa(x, el” u(x —t) v(x) dx dtd€ 


makes sense for any ae S'(W) and defines a continuous operator from S(V) 
to #’(V). The adjoint of a(x, D) is the operator 


(18.5.2) G(x, D)u(x)=(22)~" ffaly, de"? uly) dy dé, 


interpreted in the weak sense too. If a is a polynomial in & then a(x, D) is 
obtained when € is replaced by D=-—id/éx placed to the right of the 
coefficients. Putting the coefficients to the right instead we obtain the 
operator d(x, D). 

If aeS” then Theorem18.1.6 means that a(x,D) maps ¥ to ¥. By 
Theorem 18.1.7 the class of operator (18.5.1) with aeS” is the same as the 
class of operators (18.5.2) with aeS” so they can be extended to continuous 
operators from /’ to & as well. 

In the Weyl calculus one adopts the symmetric compromise 


(18.5.3) a” (x, D)=(22)-" ff a(x +-y)/2, De” u(y) dy dé, 


again defined in the weak sense. The Schwartz kernel K is 


(18.5.4) K(x, y) =n)" f a(x + yA Qee-”® dé, 
that is, 

(18.5.4) K(x+t/2,x—t/2)=(2n)-"f a(x, del dé 
is the inverse Fourier transform of a with respect to €, so 
(18.5.4)" a(x, £)=J K(x+t/2,x—1/2)e**® dt. 


(These formulas are analogous to (18.1.7) and (18.1.8).) If L is linear then 
L(x, D)= L(x, D)=L"(x, D). To explain the definition (18.5.3) we first compute 
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DL’ (x, D) a” (x, D) for aéY when L is linear. Since 


(L(x, D,) L(x + y)/2, a(x +yy/2, eto" 
=(L(O, D,) a(x + y)/2, 6) +.a((x + y)/2, &) L(D,/2, 0) *-”?, 
we obtain after an integration by parts in é 
(18.5.5) D*(x, D)a"(x, D)=b”(x,D), b=La+{L,a}/2i, 


where 


{L,a} =AL/0E, da/ax> — OL/Ox, da/ae> 


is the Poisson bracket introduced already in Section 6.4. Recall that it is the 
bilinear form in W’@ W’ which is dual to the symplectic form 


a(x, 0; yM=Koy>9-<%, (x, ew, (y, ne W. 


This indicates already the symplectic invariance of the Weyl calculus which 
is an important property to which we shall return later on. At the moment 
we just observe that although the proof of (18.5.5) above assumed that aeY 
the formula extends by continuity to all ae.¥’. Let L be a real linear form 
and set a,=exp(itL). Then we obtain 


i L(x, D) a}’(x, D)=daf’(x, D)/ét 


for {L,a,}=0. A simple explicit computation of af’(x,D)u shows that 
ay’(x, D)ueS if ueX Now the closure in I? of L(x,D) with domain ¥ is 
self-adjoint. In fact, let uel? and L(x, D)u= fel’ in the sense of distribution 
theory. Choose yeSY(W) with xzO)=1. Then £537,(x,D)u-u and 
x,(x, D)f +f in ? as e>0, and 


L(x, D)x,(x, D)u—y,(x, D) f = —ei{L, x} (ex, eD)u—0 
where 7,(x, 2)=y(ex, 6 ¢). This proves the statement. It follows that 
a," (x, D) = exp(it L(x, D)) 


in the operator theoretical sense. The general definition of a” is deduced 
from this case by Fourier decomposition of a, so the preceding property 
characterizes the Wey] calculus. 

From (18.5.3) it follows at once that the adjoint of a” is equal to a”. In 
particular, a” is its own adjoint when a is real valued, which is an essential 
advantage of the Weyl calculus and an important reason why it was in- 
troduced by Hermann Weyl for the purposes of quantum mechanics. 


In order to motivate the conditions which will be placed on the symbol 
a we shall now derive a formula for the composition of a{(x, D) and a¥(x, D) 
when a, and a, are in S(W). The kernel of a(x, D) a} (x, D) is equal to 


(2n)~?" fff a, ((x+2)/2, Cay (z+ y)/2, tek +42-% dz dl dr, 
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by (18.5.4), so it follows from (18.5.4)” that aY a3 =a” if 
a(x, 6) =(22)~?" (ff ay (x + 2 +1/2)/2, 0) a. ((x+2—-1/2)/2, t) e* dzd€ dtdt, 
E=(x—2z4+t/2,0E0+¢z—-x+t/2,0-<t,© 
=<(x—724+1t/2,€C-—€)4+ <z-—x4+t/2,7-€). 


We introduce (—€é, t—&, (z—x+t/2)/2 and (z—x—t/2)/2 as new variables 
instead of (, t, z and t. The Jacobian is 27". Hence 


a(x, =n?" (ff ay(x+z,€4 Qag(xtt, 6+ ees 9 dz dl dtdt. 


Here we regard the symplectic form as a quadratic form on W@W. 
For fe S (IR?) we have 


SSF, ye? dxdy=(4n)~* [ff (Ene~ 2"? dé dn. 


This follows from the Fourier inversion formula if f(x, y)=g(x)h(y). Hence 
the formula above can be written in the form 


(18.5.6) a(x, ¢)=exp(io(D,, D,;D,, D,)/2) a, (x, 6) aay, Mix,22 (0) 


We can therefore use the results of Section 18.4 to study a if a, and a, 
belong to suitable symbol classes. Since we have a product a,(x, €)a,(y,”) in 
the right-hand side we shall encounter quadratic forms in W@W of the 


form 
G(t,,t2)=21(t1)+ g2(t2) 


where g, and g, are quadratic forms in W. If (x, é, y,n)eW@ W and (%, & §,#) 
denote dual variables, then the linear map associated in (18.4.11) with the 


quadratic form . 
A=26(%,69,N=2¢E 9) -2KR 
maps (%, €, §, 4) to (—#, §, €& —%). Hence 
GA(x, €,y,n)=sup <x, 2>+<(E D+, 9+ Cn ?7/E(—A,+82(F —L). 
If we write (x, é)=w and (€, —%)=w’eW then <x, 2) +, =a(w,w’). Let 


(18.5.7) g3(w)=sup |o(w, w’)|7/g,(w’) 


be the dual quadratic form of g,; when W is identified with its dual by 
means of the duality defined by the symplectic form. Then we obtain 


(18.5.8) GA(w,, W2)= 85 (W2)+ 83 (1), 
(18.5.9) GSH’ G4 = g,<H’ gi = g, SH’ 3, 
for the second and third conditions are both equivalent to 
(18.5.9) |o(w, w')|? SH? ef(w) 23 (Ww), 


and the first is equivalent to the conjunction of the second and third. 
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If g, and g, are slowly varying Riemannian metrics in W, and G is the 
Riemannian metric g, ®@g, in W@W, it is obvious that G is slowly varying. 
To discuss (18.5.6) we must examine if G is uniformly A temperate with 
respect to the points on the diagonal, that is, if for arbitrary 
W,W1,W2,0,,f,EW 
(18.5.10) Bf (t1) +83 w(E2)S ClB iw, (C1) + Bowa(t2)) M™, 

M=1 + 85 w,(W2—W) + 25, (W1 —w). 
if g, =g,.=g it follows in particular when w,=w, that for reW 
(18.5.11) BSCR, ((L+g%,(w,—w)*; ww, wi EW; 
or equivalently 
(18.5.11Y gy, )S Cg, (1 +8,0%, Ww)" w, wieW. 


We therefore introduce a definition parallel to Definition 18.4.7. 


Definition 18.5.1. The metric g in W=V@V’ is called o temperate if it is 
slowly varying and (18.5.11)' is valid. A positive function m in W is called o, 
g temperate if m is g continuous and 


(18.5.12) m(w,)SCm(w)(1+2%, (w—w,))*; ww, w EW. 
Note that by (18.5.11) 1/m is o, g temperate if m is. 


Proposition 18.5.2. If g is o temperate and m,,m, are o, g temperate in W= 
V@ V' then the metric G=g, @g, in W@ W, where g, =g,=g, and the weight 
function m=m,®m, are uniformly A temperate and A, G temperate with 
respect to the diagonal. If h(w)* =supg,,/g%, then sup G,, ,,/G4,,, =h(w)* too. 


Proof. The last statement follows from (18.5.9). To prove the others we must 
show that 


gy, (WW) + 8%,,(W2-—w)S CM*, M=1+ Bi, (W2— Ww) + gh, (wy — W). 
Writing w' =w,+w,—w we have w’—w,=w,-—w, w’—w,=w, —w, hence 
gs (w2—w)S Cai, (w,—w)(1+2%,(w,—w) XS CMN*}, 
g°,.(w, -w)S CM*%*}, 
85, (W, Ww) S Cay,(w, —w)(1 +8%,.(w, —w)) SC M™, 
Bi (W2 ra w) Ss Cc M™, 
which proves the statement with the constants C’ and N’. 
Occasionally it is necessary to consider the general case where g, and g, 


are different. However, this is a rare situation so it might be best to skip the 
following result during a first reading. 
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Proposition 18.5.3. Let g, and g, be o temperate in W. Then G=g, @g, is 
uniformly A temperate with respect to the diagonal in W@ W if and only if 
(185.13) gt, (0S Cet, +e3(,—w))"; ww EW; 

Bwt)SCeiy (tL +8{.(W2—w))"s ww EW. 
The metric g=(g,+2,)/2 is then o temperate in W. If we set 

hw)? =Sup 8 jo/BSws A (Ww)? =SUP 814/84 w= SUP B2w/Biws 
then 
(18.5.14) max (h,(w)*, h,(w)*; H(w)?) 
<4sup g,/2%, Sh, (wy +h, (w)? +2 Hwy’, 


If m, is o,g; temperate then m=m,@m, is uniformly A, G temperate with 
respect to the diagonal in W@ W if and only if 


(18.5.15) m,(w,)SCm,(w)i+g3,(w—w,))"; ww eW; 
m,(w,)SCm,(w)(1+gf,(w—w,))*; ww ,eW. 

These conditions are equivalent to m, being o, g temperate. 

Proof. If (18.5.10) is valid we obtain (18.5.13) by taking t,=t, t,=0, w,=w 

or t,=0, t,=t, w,=w. Assume now that (18.5.13) is valid. We shall then 

prove that for all t,w,w,eW 

(18.5.16) Siw, (OS Cg, (0) +85,,(~—w,))". 

This implies that g is o temperate, so it follows from the proof of Proposi- 

tion 18.5.2 that for some other C and N 

Siw, (t,)+ Sw (£2) SC(g,,(t)+22,(t.)) + Bi, (Wa —w)+ Br (Wy —w))% 


which is a stronger estimate than (18.5.10) since g’?<2g§. Thus G is uniform- 
ly A temperate with respect to the diagonal. 

To prove (18.5.16) we first observe that if F, and F, are positive definite 
quadratic forms on a vector space V, then the dual form of F,+F, on V’ is 
inf,.y (Fi (. -— + F,(0), if Fj is the dual form of F, on V’. If we diagonalize F, 
and F, simultaneously we find that it suffices to prove this for the forms F, (x) 
=x7?/a and F,(x)=ax? on R. Then F/=F, and F,=F, and the verification is 
elementary. Thus we have 


Bwo(t)= inf 2(87 W(t — fo) + B3w(to))- 


The estimate (18.5.16) is therefore equivalent to 
(185.16! g;,,Q0SCg,()M", M=1+8%,,,(w—Wo) + 85w,(Wo— Wi) 
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for all t,w,wo,w,eW. Since g, is o temperate and (18.5.13) is assumed, we 
obtain for some C and N 
Siw. CZ iwoll + 25, (Wo —w,))"s CZ iwo M*, 
Cio Cg;,(1 + 25 y(W —w,)). 
Since again by (18.5.13) and the fact that g, is o temperate 
Siw (W —Wo)S C Boy, (w —wo)(1 + 25, (%o —w,))"s CMTS 


we obtain (18.5.16)’. The same proof shows that m, is o, g temperate. 
Conversely, if m, is o, g temperate then (18.5.15) follows since }g%,<g4,; if 
m, is g, continuous it also follows that m, is o,g, temperate. 

Since g,,,S2g,,, hence gf, <2g%,,, we have for j,k=1,2 


AB | eB jal ves 

which proves the left-hand inequality in (18.5.14). To prove the right-hand 
one we use that 

285 =Biwt Baw S(hj +H’) 85,,, 
which implies 

2giwS(hi+H* yg, f=1,2, 
and therefore 

4g, S(hi+hz+2H’)g, 


which is the right-hand inequality in (18.5.14). The proof is complete. 


Combining Theorem 18.4.11 and Proposition 18.5.2 we now obtain the 
main theorem of the Weyl calculus: 


Theorem 18.5.4. Let g be a o temperate Riemannian metric in W=V@V' with 
g<g’, and let m,,m, be o, g temperate weight functions in W. Then the 
composition formula (18.5.6) can be extended to a weakly continuous bilinear 
map (a,,a,)-a=a, #a, from S(m,,g) x S(m,,g) to S(m, m2, g). If 


(18.5.17) h(x, €)? =sup g,. -/g%, 2 
then the map from a,éS(m,,g) and a,€S(m,,g) to the remainder term 
a, #a,(x,)— ¥ (io(D,, Dz; D,, D,)/2) a(x Dasly, mii! 
j<N 
evaluated for (x,€)=(y,n), is continuous with values in S(hN m,m,,g) for every 
integer N. It is zero if a, or a, is a polynomial of degree less than N. 


The terms with j even (odd) are (skew) symmetric in a,, a,. This implies 
that 
a, #a,—a,+#4, —{a,,a,}/ieS(h* m, m,,g), 


a, #4,+4, 4a, —2a,a,ES(h* m, my, g). 
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In both cases we would have a factor h less in the calculus developed in 
Section 18.1. 

Using Proposition 18.5.3 instead of Proposition 18.5.2 we obtain a more 
general result: 


Theorem 18.5.5. Let g, and g, be o temperate metrics in W=V@V' satisfying 
(18.5.13), and assume that the function 


(18.5.17) H(x, é) as sup Six, e/89x,e= Sup 82x, /Bix,e 


is <1. Set g=(g,+28,)/2, and let m, be g, continuous o, g temperate weight 
functions. Then the composition formula (18.5.6) can be extended to a weakly 
continuous bilinear map (a,,a,)ra=a,+#a, from S(m,,g,)xS(m,,g,) to 
S(m,m,,g). The map to the N™ remainder term is continuous with values in 
S(H™ m, m,,g) for every integer N. 


Note that the error terms in the calculus improve by powers of H which 
may be much smaller than the function h defined by (18.5.17). 
To give some examples we first observe that if B(x,é)=2<x,é> then 


ge (yn) =, -(y, —1) since 
Cy, é» oe «x, n> =a((y, _ n), ame (x, €)). 


If the V and V’ directions are g orthogonal at every point it follows that g? 
=’. From the discussion at the end of Section 18.4 it follows therefore that 
the metric (18.4.1) is o temperate if O<d<p<1 and 6<1. More general o 
temperate metrics enter the theory in the following way: 


Proposition 18.5.6. Assume that g is o temperate, that G=mg, where mz1, is 
slowly varying, and that G<G°. Then it follows that G is o temperate. 
Proof. We must show that for some C and N 


(18.5.18) G,,, S$ CG,,(1 + G2, (w—w,))%. 


If G,(w—w,) is sufficiently small this follows from the hypothesis that G is 
slowly varying, so we may assume that G,(w—w,)2c,. Next assume that 
£,(w—Ww,)<c with c so small that this on 


Ose 
Then (18.5.18) follows with N =1 if we show that 
m(w,)SC’ m(w) gf, (w—w,)/m(w)). 
Now 
Cy Sm(w) g,,(w—w,)S Cm(w)g,,, (w—wy)S Cm(w) h(w,)? 2%, (w—wy) 


if h is defined by (18.5.17), and h(w,)? m(w,)? <1 since G<G*’. This proves 
the assertion, and it only remains to study the case where g,, (w—w,)2c, 
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for some fixed c,>0. Then we have 
gy, (W—w,)/m(w,)= GY, (w—w,)2G,, (w—w 2c, m(w,), 
which implies 
c,m(w,)*<g% (w—w,), Cy 8%, (w—wy)SG%, (w—w,). 
Since ; 
Bw, SC By(l+e%,(w—w,))” 
and m= 1, we obtain (18.5.18) with N=2.N‘+1. The proof is complete. 


For conformal metrics the condition in Theorem 18.5.5 also simplifies: 


Proposition 18.5.7. Assume that g, and g, are conformal o temperate metrics 
with h,(w)? =sup 8 jw/Sjw St. Then (18.5.13) is valid and the function H in 
(18.5.17) is (h, h,)*. 


Proof. By hypothesis g,=mg, for some m. Thus h} =m7h? and 
H(w)? =sup g>,,/8$,=m(w) h, (w)? =h, (w) ha (). 


The first estimate (18.5.13) follows from the slow variation of g, when 
Siw(W, —w) Se. If g,,(w, —w)2c then 


85(W— Wy)? =m(w)~? gi (w— Ww)? 
=m(w)~? hy(w)~? gy,,(W—W,) 87(W—W,) 
2cgi,(w—wy,) 


since m(w)h,(w)=h,(w)S1. Since g, is o temperate the first estimate 
(18.5.13) follows, and the other is proved in the same way. 


Theorems 18.5.4 and 18.5.5 will give a farreaching generalization of 
Theorem 18.1.8 once we have an analogue of Theorem 18.1.6 allowing us to 
regard a, +a, as the symbol of a composition of operators in Y or in #. 
Since this is not quite straight-forward we postpone the proof to Sec- 
tion 18.6. Instead we shall discuss here the invariance of the Weyl calculus 
under affine symplectic transformations y, that is, affine maps y in W with 
y* o=0. In doing so we may assume that V=IR". Examples of such maps 
are 

(a) The translation x->x+x, in V. 

(b) The translation ++, in V’. 

(c) The map x(x, €) replacing x;,é; by ¢;,—x;, leaving the other coor- 
dinates unchanged. 

(d) The map x(x, €)=(Tx,'T~* & where T is a linear bijection in R”. 

(e) The map x(x, )=(x,€—Ax) where A is a symmetric matrix. 


Lemma 18.5.8. Every affine symplectic map is a composition of maps of the 
preceding types (a)-(e). 
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Proof. Since (a) and (b) supply all translations it is sufficient to consider 
linear maps y. The group G generated by the maps of type (c)-(e) is 
transitive. In fact, the basis vector e, =(1,0,...,0) is mapped to (1,0, ...,0, €) 
for any desired € by the map (e) for an appropriate choice of A, hence to 
any (x, é) with x+0 if we follow with a map of type (d). All elements of the 
form (0,é), +0, are obtained if we use the maps (d) and (c). Any x is 
therefore the product of an element in G and a symplectic linear map x, 
with y, e, =e,. Thus 


o(x1(x, €),e,)=a((x, O),e)=C, 


sO x, preserves the &, coordinate. If x’=(x,,...,x,) and é’=(€,,...,é,) it 
follows that the map y,(x’,é’) defined by 7,(0,x’,0,¢') when the x, and €, 
coordinates are dropped is also symplectic. If n>1 and the statement is 
already proved for smaller values of n we can now write 7, =737>2 where 7, 
is in the group G corresponding to the x’ ¢’ variables and 


3 (X, C)=(X Hay Ey, 0 Xp Hn EE 1562492 E,,..-56, +5, €). 


That 7, is symplectic means that 


Vide, najdé,+b,dé, adx,)=0 

2 
SO a,=...=a,=b,=...=b,=0. Thus x, is the map (e) with Ax 
=(a, x,,0,...,0) conjugated by (c) with j=1. This completes the proof. 


Theorem 18.5.9. For every affine symplectic transformation y in W=V@V' 
there is a unitary transformation U in I7(V), uniquely determined apart from a 
constant factor of modulus 1, such that for all linear forms L in W 


(18.5.19) U~! L(x, D) U =(Lex)(x, D). 
U is also an automorphism of ¥ and of F', and 
(18.5.19)' U-' a(x, D) U =(acy)”(x, D) 
for every aeS'(W). 


Proof. It is sufficient to prove uniqueness when y is the identity. So let U be 
a unitary transformation with 


UL(x, D)=L(x, D)U 


for all linear forms L. Then U commutes with the one parameter group 
generated by L. In particular, U(fg)=f(Ug) if gel?(V) and f is a bounded 
exponential, hence if f is any function in “ for it can be decomposed into 
exponentials by the Fourier inversion formula. It follows that Ug=hg for 
some h of modulus 1 almost everywhere. Since U commutes with trans- 
lations also, it is clear that h must be a constant. (Note that the result is 
extremely close to Lemma 7.1.4.) 
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To prove the existence of some U satisfying (18.5.19) it suffices to 
consider the cases (a)-(e) above. Then we can take 

(a) Uf(x)=f(x—Xo); (b) Uf(x)=f(x)exp<ix,co>. (c) U= Fourier 
transformation with respect to x,; (d) Uf(x)=f (T~! x)|det T|~*; (e) Uf (x) 
=exp(—i¢(Ax,x>/2)f(x). This completes the proof of (18.5.19). If a(x, 
=exp(i L(x, €)) for a real linear form L then a(x, D)=exp(iL(x, D)) in the 
operator theoretical sense so (18.5.19) is valid then. Since bounded exponen- 
tials are weakly dense in # it follows that (18.5.19) is always true, which 
completes the proof. 


We shall now study the relation between the Weyl calculus and the 
calculus established in Section 18.1. First assume that ae S(W). Then a(x, D) 
has a kernel KEY given by (18.1.7), so we can write a(x, D)=b”(x, D) where 
be is given by (18.5.4)”, 


b(x, €)=f K(x +t/2,x—t/2)e 7% dt 

=(2n)-"[fa(xtt/2,nel-? dy dt 

=n" {fa(xtte+nye%%” dtdn 

= e(Pn De>I2 a(x, 8, 
Here the last equality follows from the argument which led to (18.5.6). If 
€(x, D) is defined by (18.5.2) then it follows from (18.1.9) that a(x, D)=é(x, D) 
if 

a(x, €) =e! P=P® e(x, €). 

For reasons of continuity these observations are also true if a,b,ceS’. Now 
we pointed out after Theorem18.5.5 that a o temperate metric g with 
8x, e(t, 7) =, (t, —7) is also B temperate if B(x,€)=2<x,¢>, and m is then o, 
g temperate if and only if m is B, g temperate. Hence we obtain: 


Theorem 18.5.10. Let g be o temperate, g<g°, and let m be o, g temperate. If 
&x,2(t, 7) =g,, (t, —t) then exp<ixD,,D.> is a weakly continuous isomorphism 
of S(m,g) for every KER, 
elk OnP® a(x, )— Y) Kix D,, Dz>i a(x, O)/j !eS(h’ m, g) 
J<N 

for every integer N if h is defined by (18.5.17). If a,b,ceS(m, g) then a(x, D) 
= b”(x, D)=é(x, D) if and only if 
(18.5.20) d(x. cee! a(x cy Sere Per ooag), 

a(x, €) = el P~ P#!? b(x, Z) = eh P=?” e(x, £), 

gc) ee te aad) See Pe T(x, 2), 
If g,. -(0, t)<|t|? then the bilinear maps (a, u) a(x, D)u, (b, u)t> bY (x, D) u and 
(c, u)rH E(x, D)u are continuous from S(m,g)x ¥ to F and from S(m,g)x S’ 
to #’. 
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Proof. It suffices to show that (a,u)++a(x,D)u is continuous from S(m, g) 
x F¥ to & Since 


m(x, €)S Cm(0,0)(1 +86, of, YS CCL + 1x1? +18)" 


for some C, C’ and N, the map is continuous with values in the space of 
continuous functions f with sup| f(x)I/(1 +|x|?)¥ < oo. By (18.1.6) we have for 
some constants C,, 
(1+|x?"a(x,D)= D0 cyga(x, D) x?, 
la+BlSN 

and the assumption on the metric implies that a->a™ is a continuous map 
in S(m,g) for every «. Hence a(x,D) is continuous with values in the 
continuous bounded functions in IR". The continuity with values in # 
follows from the proof of Theorem 18.1.6 since for any teW 


<D,t> aeS(m’, g) 


where m'(x, €)=m(x, ¢)g,. 2(t)* is also o, g temperate. (See also the proof of 
Theorem 18.4.10’.) This completes the proof. 


Remark. Theorem 18.6.2 below will show that the restriction imposed on the 
metric in the continuity statement is superfluous. However, the proof is 
much less elementary then, and the easy result proved above covers the 
metrics which are most frequent in the applications. 

If ajeS(m;,g) and ue we can conclude under the assumptions made in 
the theorem that 


a} (x, D) a} (x, D)u=(a, #a,)"(x, D)u. 


In fact, we can take sequences a,,eS which are bounded in S(m,,g) and 
equal to a; on any compact set for large v. Then a},(x, D)u— a(x, D)u in S 
as v- 00, SO 

(a,,4#4,)"(x, D)u=a¥,(x, D) a(x, Du a(x, D)a(x,D)u in 


Since a,,#a,, converges weakly to a, 4a, the assertion follows. 

Still under the hypotheses in Theorem 18.5.10 one can easily recover 
composition rules for standard pseudo-differential operators from the pre- 
ceding results. Suppose for example that a;e¢S(m,,g) and set a,(x,D)a,(x, D) 
=a,(x, D). Then a,(x, D)=b?’(x, D) for 

b,(x, C= e710?! a(x, €), 
and 
b, (x, é) == ele, Dy) — Dx, Dn dM2 bi (x, é) boy, nlx, Eé=y,n° 
Since 
((D,+D,,D,+D,>+<Dz D,»> =<), D> 
~{D,,D,>—<D,,D,>/2=<Dz, D,> 
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it follows that 
(18.5.21) a3(x, =e PHY a. (x, ars Mi eoyn 


which is precisely (18.1.14) which is therefore proved now in much greater 
generality than in Section 18.1. 


If a(x, D) is a pseudo-differential operator with polyhomogeneous symbol 


a(x, €)~a,,(x, E)+a,,_ (x, E)+ ase9 


where a, is homogeneous in € of degree j, we can use Theorem 18.5.10 to 
write a(x, D)=b”(x,D) where b(x,é)~}'b,,_ 0%, ©). Then 

b,,(X, €)=a,,(x, é), Deve 1(X, C=ay_ 1X, é)+ i . a? Dn {Xs 6)/0x ,0¢,/2, 
and b,,_, is the subprincipal symbol introduced in (18.1.32). Thus the Weyl 
calculus explains the role of the subprincipal symbol, and the composition 
formula in the Weyl calculus gives immediately rules of computation for the 
subprincipal symbol. 
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In this section we shall first prove that a”(x,D) is continuous in ¥ and in 
SF’ when aeS(m,g) if g is o temperate and m is o, g temperate. In a special 
but important case this was proved in Theorem 18.5.10. When the hy- 
potheses of Theorem 18.5.4 or Theorem 18.5.5 are fulfilled the result gives as 
in the special case that (a, 4#a.,)”(x,D) is the composition of a(x, D) and 
a3(x,D) in ¥ as well as in Y’. When g<g’ and m is bounded (resp. m—0 at 
00) we shall prove that a”(x, D) is bounded (resp. compact) in L?. This result 
combined with the calculus in Section 18.5 will give efficient lower bounds 
for operators with non-negative symbol. 

As usual we shall split the symbol by means of the partition of unity in 
Lemma 18.4.4. To handle an individual term it suffices to have the following 
elementary result: 


Lemma 18.6.1. For every ac Y(IR2") we have with operator norm in I? 


la" (x, D)| S$(2m)-*" alps =llallen- 


Proof. The Schwartz kernel K of a”(x, D) is 
K(x, y)=(2m)“"Ja((x+y)/2, eb” d& 
=(2)~?" | (0, y—x) Ft a8, 
Hence 
IK yldxSllalens, JIK@, yl dySllallen, 


so the lemma follows from Lemma 18.1.12. 
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Remark. When using the lemma we shall need the obvious fact that |ja||,;: 
can be estimated by a finite number of seminorms in & It is also essential 
that this norm is invariant under composition with affine transformations, 
which is obvious since it is the total mass of characters in the Fourier 
decomposition of a. 


Theorem 18.6.2. If g is o temperate and m is o, g temperate, then a”(x, D) is a 
continuous map from S to F and from F' to F' for every aeS(m, g), and it is 
weakly continuous as a function of a. 


Proof. It suffices to prove the continuity in ¥ since the continuity of a” in 
fF implies that of a” in Y’. Let {@,} be the partition of unity introduced in 
Lemma 18.4.4, define the neighborhoods U, of supp@, as before Lem- 
ma 18.4.8, and set 

a=)\a,, a,=, 4. 


If w, is the center of U,, the seminorms of a, in S(m(w,),g,,,) have bounds 
independent of v. By Lemma 18.6.1 and the remark following it we conclude 
that 

lay’(x, D)|| S Cm(w,), 
for all positive definite quadratic forms are equivalent under arbitrary linear 
transformations. Thus we have with I? norms 


lay (x, D)ul| S Cm(,) [lu]. 


To obtain a better estimate we shall use an argument parallel to the proof 
of Proposition 18.4.6. 

Let L be an arbitrary linear form on W which is positive in U,. By 
Lemma 18.4.5 we have a uniform bound for L(w,)/L in S(1,g,,) over the 
support of ¢,. Now Theorem 18.5.4 or just (18.5.5) gives 
(18.6.1) a(x, D)u=(a,/L)”(x, D) L(x, D)u+if{a,/L,L}"(x, D)u/2. 

Here the symbol of a,/L is uniformly bounded in S(m(w,)/L(w,), g,,,). If we 
write L(w)=o(w,t), teIR?", then 

{a,/L, L} =—-L Xt, da,>. 
Thus we have uniform bounds for the seminorms of {a,/L, L} in 


S(m(w,) 8y,(07/L(w,), By,)- 
L(x, D)u is a linear combination of x,;u and D,u with coefficients bounded 


by the length of t in a fixed metric such as gy. Iterated use of (18.6.1) gives 
(18.6.2) — la¥(x,D) ul] SCym(w,)R>*% Y |ix*DFull, ueF, 


ja+B|SN 
for every positive integer N, provided that 0<R,<L(w,), L+0 in U,, go(t) 
+2,,()S1, L(w)=o(w,1). By the Hahn-Banach theorem we can take for R, 
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the distance from 0 to U, in the norm dual to (fo+8y,)* with respect to o. 
This norm is defined by 


wl], =inf(g2, (w,)+2(w,))7, W,+W,=W, 
(See the proof of Proposition 18.5.3.). Thus we can take 


R,=min [wl 
weU, 


In a moment we shall prove that for some N 
(18.6.3) Y(L+R,)-%<oo, hence R,> 0, 
(18.6.4) Zo(wSCA+R,)”. 


Admitting these estimates we conclude using (18.6.2) for large v that 
|a’(x, D)ul| can be estimated by a seminorm of u in & if 


m(w,)< C(1+go(w,))” 


for some N, which is always true when m is o, g temperate. Let M(x, D) be 
linear in (x, D). Then M(x, D) a(x, D) has the symbol 


Ma,+{M,a,}/2i 


which is bounded in S(m;,g,,.) for some other m', bounded by a power of 1 
+g (w,). Hence the ? norm of M(x,D)a”(x,D)u can be estimated by a 
seminorm of u in “& Repeating the argument we conclude that a(x, D) is 
continuous in & 

It remains to verify the estimates (18.6.3) and (18.6.4). The proof is 
parallel to that of Lemma 18.4.8. Let 


M, = {v; R2 Sk}. 
When veM, we can choose w,¢U, and w* so that 
SHAW, Wyk, go(wy) Sk. 


In the first inequality we pass to the equivalent norm g%, and conclude 
using (18.5.11) that 


Bon(We—- WSC, g8(wi wi) SCR, 
Hence 
wy Bw SCs By kM ES Cy By kM2. 
80 (w, ba i) Ss C, SulWy w,) ke Ss Cy ks, 
Bo(W)/2 Sgo(w, —wy) + go(wy) SCs k™?. 


The estimate (18.6.4) is an immediate consequence, and (18.6.3) follows if we 
repeat the proof of Lemma 18.4.8. This is left for the reader. 


Br, CR, SC, kM SC, Ok, 


Next we shall discuss continuity in L’. 
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Theorem 18.6.3. Assume that gg", that g is o temperate, and that m is o, g 
temperate. Then the operator a”(x,D) is L? continuous for every aeS(m,g) if 
and only if m is bounded. The L? norm of a“(x,D) is then a continuous semi- 
norm in S(m, g). 

In the proof of necessity we shall need the following lemma: 


Lemma 18.6.4. If g is a positive definite quadratic form in R*" then there is a 
linear symplectic map y in IR?" such that 


a(x(x, E)= > Aj(cj + G5). 
Here A, are uniquely determined by g, and 


sup g/g’ = max J}. 


Proof. Since sup g/g* is symplectically invariant it suffices to prove the last 
statement when g(x, £)=)"A,(xj +7). Then we have g*(x,é)=) (x? +?)/A, 
so the statement is obvious. To prove the lemma we shall consider the 
eigenvalues and eigenvectors of the map 


F: (x, C)r+ H, (x, €)/2 =(08/0€, — 0g/0x)/2 


which is defined in a symplectically invariant way. Note that if g(x, é) 
=) A(x} +) then 

F(x, E)=(A, by -es An Ege Ay Xs cers —Ag Xp)s 
so x,=1, ¢;= +i, all other coordinates 0, is an eigenvector with eigenvalue 
+id,;. Hence 4,,...,4, are uniquely defined by g apart from the order. For 
general g, if z is an eigenvalue with eigenvector (y,q)eC2", then 

dg/On=2zy, dg/dy=—2zN. 

Writing (0, w) =) 0, W, if 6, we€", we obtain 


(dg/dy, y) + (6g/An, n)=22((y,n)—(, y)- 

In the left-hand side we have a positive definite Hermitian form, and 

(y,n)—(n, y)=2i(CIm y, Ren —<Re y, Im>)= —2ia(e,,e;) 
if e, and ¢, are the real and imaginary parts of (y,7). Hence 4=2/i is real 
and not 0. Taking the complex conjugates of y,n,z if necessary we may 
assume that 4>0. Then we have o(e,,¢,)>0, or o(e,,¢,)=1 if we multiply 
by an appropriate constant. Using the symplectic maps (c)-(e) in Lem- 
mai8.5.8 it is easy to find a linear symplectic map y mapping the unit 
vectors along the x, and €, axes to e, and e, respectively. (See also 
Proposition 21.1.3 for a complete proof.) Replacing g by goy we have then 


g(x, €)=A(xt +¢7) +81(% ¢) 
where g, is independent of x, and ¢,, for dg/0x,=0g/0¢,=0, j+1, at the 


eigenvectors (1,0,...,0, +1,0,...,0) of F. This proves the lemma by induction 
with respect to n. 
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Remark. In Theorem 21.5.3 we shall give a much more thorough discussion 
of the symplectic equivalence of quadratic forms using the tools developed 
in Section 21.1. Note that in Lemma [8.6.4 we can equally well reduce g to the 
quadratic form ) (xj +47 ¢}). Lemma 18.6.4 is well known in classical me- 
chanics where 4, occur as the fundamental frequencies of the small oscil- 
lations of a mechanical system around an equilibrium point. 

In the proof of sufficiency in Theorem 18.6.3 we shall use the same 
decomposition of a as in the proof of Theorem 18.6.2. To contro! the sum 
we need the following lemma of Cotlar, Knapp, Stein, Calderén, Vaillancourt, 
Bony and Lerner on sums of almost orthogonal operators, with an additional 
remark on compactness which prepares for Theorem 18.6.6. 


Lemma 18.6.5. Let A;, i € I, be a countable number of bounded oper- 
ators from one Hilbert space Hy to another Ho such that for j € I 


(18.6.5) Sper AARP <M, ger AAG <a 
Then >); 4 (Anu, Aju)| < M?|lull?, u © Hi, the sum 


Su= > Aju, ue Mh, 
gel 
exists with norm conergence in Ho, and ||S|| <M. If the sum 
S3=)0 4; 
ged 
is compact for every J C I, then ||A;|| + 0 when j > 00 in I. 
Proof. Set T = S)cyjpAFA, where a € C, |ajyx| S 1 and only a finite number 
N are not zero. Then T* = >) Oy Af Ay. We have [7 ||? = ||T*7 ||, and more 
generally ||7'||?" = |\(7*T)"|| by the spectral theorem, if m is a positive integer. 
For the terms in the expansion 
(T*T)" = DO Aj, Aj jaja Aj, Aj ++ jen jan A 


am 1 Aiinn 
we have the estimate 
Aj, Aip -- - Ajgn —» Alam I 
S min(|Aj, All --- Aj, Aian I> 
Aj, I Aja Agel ~~ [Aja —2 Afr Hl IIAfonUl)- 
Taking the geometric mean of the two upper bounds, and noting that ||A;|| <M 
by hypothesis, we obtain 


IT" || SM Slajpl At Ap ll? Ap AfM? MAE, Ajuell?- 


a 
The sum is taken over j1,.-..,j4m- If we use (18.6.5) to estimate successively the 
sum over jam,---5Jj2, then only the sum over j; is left over and we see that 


(71-" <NM™, 
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Taking 2m" roots and letting m— oo yields || T|| < M7, hence 
[Yau(A,u,Aju)| = |(Tuw)| SM? lull, we Hy, 
and taking the supremum over all choices of («,,). we obtain 
\(Axu,Aju)| SM? |lul!?,  u ey. 
Kel 


ds 


If J CJ is finite and AV) = >0,¢, A; it follows that ||AVJ)u|| S M|lu||, and if 


J' > J is another finite subset then 
(AU) - AG ul? S SD Aju, Aew)]. 
jkel\s 


Since }7 |(Aju,Axu)| is convergent it follows that Su = }°,<, Aju converges in 
Hp, and that ||Su|| < M |lul]. 


To prove the last statement it suffices to show that if || A;|| 2 c > 0, jeJ, then 
> jes 4; is not compact for some J C I. To do so we shal! choose j,, j2,...€1 so 
that ),4,l14;,A4%, || $1/u for every uw. Assume that j;,...,jy-, have already 
been chosen so that this is true with strict inequality for u<N if v<N in the 
sums. We must then choose jy =j so that 


SALA <I/N, CIA ALI FIA ARI <I/m w<N. 
U<N vAB 


By the second part of (18.6.5) the first inequality holds except for finitely 
many j, and since ||A, A?| =||A,, Afi] the same is true for the other inequal- 
ities, by the inductive hypothesis. Set J={j,,j,,...}. For every j we choose 
uj € Hp with ||u;|| = 1 and ||AFu;|| 2 ||AF||/2 2 ¢/2 if j € J. Then vj = Aj uy 
converges weakly to 0 in H. In fact, 


* 
j u 


(Af u;, Af v)=(A, AF u;,v) 70 as joo 

so (Afu;,w)—+0 for every w in the closed linear hull of the ranges of the 
operators Af. Since A,w=0 for every k if w is in the orthogonal comple- 
ment, we have (Afu;,w)=0 for all j then. Thus v, converges weakly to 0. If 


S, is compact, it follows that ||S,v;|| +0 as j-+ 00. We have with j,keJ 


S,vj=A,Afuyt ¥ AAS uy 
i 


The norm of the sum is at most 1/y if j = j,. Hence ||A; Aj uj|| > 0, so 


|A¥ uj)? =(A, AP uj,u,) 70. 


ICU TP 
This is a contradiction which completes the proof. 
Proof of Theorem 18.6.3. First we prove the necessity so we assume that 
a”(x, D) is L? continuous for all aeS(m, g). Since the map 


SF’ 3ar+{a”(x, D) u,v) 
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is continuous for all u,ve SF it follows that the map 
S(m, g)3ata”(x, De L(V, I?) 


given by the hypothesis is closed. Hence it is continuous by the closed 
graph theorem. Let w, be a sequence going to oo in W. We shall prove that 
the sequence m(w,) has a bounded subsequence. By Lemma 18.6.4 we can 
take a linear symplectic transformation y, from IR", with the usual sym- 
plectic structure, to W such that g,, (x,(y, ie is reduced to the diagonal form 


LF + Aju; 
Since g<g’, all 4,, are bounded by 1. Thus we may assume that their limits 
exist as v—> oo. Let pe CX(IR?”) and set 
(18.6.6) e,(x, 6) = P(x, A, C=, (w, + X,(% 6), 
where 1,€=(A,,6,,..-,4,, &,)- If the support of ¢ is small enough it follows 
that m(w,)b, is a bounded sequence in S(m, g). The norm of bY(x, D) is equal 
to the norm of e”(x,D) since these operators are unitarily equivalent by 
Theorem 18.5.9. Now 
ev (x, D)u=(2m)-" ff h(x + y)/2,A, ei” u(y) dy dg 
has a non-zero limit as v— 00, for a suitable choice of ueC?. It follows that 
lle?'(x, D) ull is bounded from below for large v. Hence 
ilm(w,) bY (x, D)|| > cm(w,) 


which proves that m(w,) must be bounded. This ends the proof of the 
necessity. 

To prove the sufficiency we assume now that m=1. Writing a=)\a, as 
in the proof of Theorem 18.6.2 we have, again by Lemma 18.6.1, 

lay (x, D)|| SC. 
To prove that (18.6.5) is valid for A,=a(x,D) we must consider the com- 
positions 
ay, =a; ay, by, =ay ay. 
Of course it suffices to discuss a,,. By (18.5.6) we have 
a, ,(X, 4) = ete eae 2 G,(x, é) a,(y, Mix,2= (y,n)* 


We apply (18.4.16) with d, replaced by d|, defined by substituting g4 for g4 
in the definition (18.4.15) of d,. This is legitimate since 1+d,,<C’(1+d,)" by 
(18.4.13)". Hence we have for any positive integer k 
\a,,(w)] SC,(1 + M(w))~*, 
where 
M(w)=min g°,(w—w’)+ min gv,(w—w’). 
Uy Up 
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It is clear that 

8 (w' —w") $2 M(w) 
if w’eU, and w’eU, are chosen so that the minimum in the definition of M 
is attained. We also have 


(18.6.7) gh, (w —w")SCat(w —w") SC, gh (w —w")(1 + 95,(w—w))" 
SC,(1+M(w))**4, 
(18.68) 1+g,,(w—w,)SC(1+g,y-(w—w))SC,(1+.9%,(w—w’)) 
SC,(14+2%(w—w)**1<5C,(1+ Mw) tt. 
With the notation 


a 4 o ’ “uw 
a, = min 5, (W —w’”) 
w’eU,y, weUy, 


it follows from (18.6.7) with new constants C and N that for all w 
1+d,,<C(l1+M(w))’. 


va 
Taking also (18.6.8) into account we conclude that for any k 
(18.6.9) la, (WSC, +d,,)“*(L+9,,(w—w,))* 


The same estimate is valid for any seminorm in S(1,g,,). In fact, if g, (t)<1 
and we apply the differential operator <D,t> to a,,, we obtain one term 
where the differentiation falls on a,, which does not affect the estimate, and 
one where it falls on a,. This may lead to a loss of a factor (8,,(0)/80, (t))? in 
the estimates. Since it is bounded by some power of 1+d,,,, our assertion is 
proved. Hence Lemma 18.6.1 combined with the remark following it gives 


lay a(x, D)|\ ES Cy(h +d,,)-™ 
for any N. We shall prove in a moment that for some C and N 
(18.6.10) Vad4+d,) %*sC, Yt+a,) %sc. 
v HB 
From Lemma 18.6.5 it follows then that ||a”(x, D)|| <C’. 


It remains to prove (18.6.10), which is closely related to Lemma 18.4.8. 
Since for w’eU, and w”eU, we have 
1+gi (Ww —w)SC(1+g%.(w —w)S CL + 9%,.(w' —w"))"" 
SC"(1 +3, (Ww —w', 
it follows that 
1+d,,sC'(i+d,,)". 
Hence it suffices to consider the sum in (18.6.10) with respect to uw. We 
choose g,, orthonormal coordinates with 0 at w,. Then gf, is at least as 
large as the Euclidean metric g,, . If d,,<k then there is some point w’eU, 
with Euclidean distance at most Ck? from 0. The metric at w” can be 
estimated by a power of k times the Euclidean metric, so a ball of radius 
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k-™ with center at w” is contained in Uj. As in the proof of Lemma 18.4.8 it 
follows at once that the number of indices u with d,,<k is bounded by a 
power of k, which proves (18.6.10). 


Theorem 18.6.6. Assume that g<g’, that g is o temperate, and that m is o, g 
temperate. Then the operators a”(x,D) with aeS(m,g) are all compact in L? if 
and only if m0 at oo. 


Proof. With the notation used in the proof of Theorem 18.6.3 it is clear that 
a” is compact for every v. If m-+0 at.co we can for every e>0 choose a 


v 


finite set N(e) such that 
(a— Y a,)/e 


yeN({e) 


is bounded in S(1,g) when ¢—>0. Thus we have by Theorem 18.6.3 
lla"(x,D)— J) ay(x, D)|| SCe, 


veN(e) 
which proves that a”(x,D) is compact. Assume on the other hand that all 
a”’(x, D) with aeS(m,g) are compact. Then it follows from Theorem 18.6.3 
that m is bounded. If b, is defined as in the proof of necessity in Theo- 
rem 18.6.3 and m,=m/(w,) then 
dm, b,eS(m, g) 


ved 


for every subset J of the index set. Hence 


> m, bY(x, D) 

vet 
is compact by hypothesis, and b” satisfy (18.6.5) by the proof of sufficiency 
in Theorem 18.6.3, so it follows from Lemma 18.6.5 that m, ||bY(x, D)|| +0. 
Since ||bY(x,D)|| is bounded from below we conclude that m,->0 as v—> oo, 
which implies that m(x, €) +0 as (x, £)- 00. The proof is complete. 


So far we have only considered scalar pseudo-differential operators. 
However, it is clear that the calculus developed in Section18.5 is not 
changed at all if one allows the functions u to take their values in a Banach 
space B, and the symbol a to take its values in #(B,,B,), so that a”(x, D)u 
takes its values in B,. However, the discussion of L? estimates here depends 
on Lemma 18.6.5 where the Hilbert space structure is vital. Theorems 18.6.3 
and 18.6.6 are therefore applicable only when B, and B, are Hilbert spaces 
which in the second case must also be finite dimensional. 

As a first application we shall prove a general version of Theo- 
rem 18.1.14. Note that with the notation there the Weyl symbol of (a(x, D) 
+a(x, D)*)/2 is Re(a+i/2¥ 0? a/6x,0€,) modulo S”~?. We only consider the 
case of Theorem 18.1.14 where m=0, for spaces corresponding to H,, have 
not been defined here in the general context. 
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Theorem 18.6.7. Let g be o temperate, 


(18.6.11) h(x, é =supg,, ./s eS, 
and assume that 0<aeS(1/h, g). Then it follows that 
(18.6.12) (a”(x, D)u,u)=—C|lull?, ue, 


with scalar product and norm in L?(IR"). 


Proof. For the metric 
G=(a+1)-!h-'g 
we have 
sup G,,/G%,=(a(w)+ 1)~? h(w)~? sup g,,/2, =(a(w)+1)77 <1. 


If we show that G is slowly varying it will follow from Proposition 18.5.6 
that G is o temperate, for h(a+1) may be replaced by min (1,h(a+1)) since 
h(a+1) is bounded. Thus a+1 is o, G temperate. We shall also prove that 


(18.6.13) a+1leS(a+1,G), 
that is, 
k 
(18.6.13!  fa(w3t,,...,t)1S C,(a(w) +1)! -"/? h(w)-#? [1g.(t)*- 
1 
When k2=2 this follows from the hypothesis aeS(1/h, g), so we only have to 
show that 
(18.6.13)" |h(w)a'(w)t| S C(h(w) a(w))? g,,(0)?. 


If we introduce g,, orthonormal coordinates z,,...,2,, with the origin at w 
and regard h(w)a as a function F(z), then 


IDE F(Z\ISC,,  |zi<e, 
for all « since aeS(1/h, g), and F 20. Thus it follows from Lemma 7.7.2 that 
|F’(0)| < CF(0)* 
which proves (18.6.13). By Taylor’s formula we obtain 
F(z)+h(w)2(FO)+h(w))/2 if |2|?<c,(FO)+A(w)), 
that is, 
2h(w)(a(w,)+ HZh(wy(a(w)41) if G,(w—wy)<cy. 


Thus G,, <CG,, then so G is slowly varying. 
If F is a C™ function with 


|i FY(QI<C,F(), t>0, 


and meéS(m,G), it is immediately verified that F(m)eS(F(m), G). (Cf. Lem- 
ma 18.4.3.) Taking F(t)=t? and m=a+1 we obtain b=(a+1)*eS(b,G). Now 
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Theorem 18.5.4 gives 
a” (x, D)+ 1 =b™(x, D)? +c(x, D) 


where ceS((a+ 1)~', G)cS(1, G). Hence c”(x, D) is bounded which proves the 
theorem since b”(x, D)? is positive. 

In the last part of the proof there was so much information given away 
that one might suspect that a better result is valid. Indeed, we shall now 
prove the stronger Fefferman-Phong inequality: 


Theorem 18.6.8. If g is o temperate and (18.6.11) is fulfilled then (18.6.12) is 
valid for every aeS(1/h*, g) with a=0. 


A crucial point in the proof of Theorem 18.6.7 was the application of 
Lemma 7.7.2. We shall now prove a similar result which takes derivatives of 
order <4 into account. By B, we shall denote the ball {xeIR”;|x|<r} where 
|x|? =e is the Euclidean metric form. 


Lemma 18.6.9. Let O< feC™(B,) and assume that 


(18.6.14) \flg@)s1, xeB,, 
(18.6.15) max (| f(0)|,1/13(0))=1. 

Then we can find r>0 independent of f such that 
(18.6.16) z<max(|f(x),|f15(x))<2,  xeB,, 
(18.6.17) If\G§@)<8 ff i<4, xeB,, 
(18.6.18) SW=f,od+ge(x), xeB,, 


where f,,g€C™(B,) and f, 20, <y,0>f,(x)=0 when xeB, for some yeR’ 0. 
The supremum of |D* f,| and of \D* g| in B, can be estimated by the supremum 
of |D* f| in B, for |p| S2+|al. 


Proof. We shall first estimate |f|{(0) and |f|3(0). Let f;(x) be the Taylor 
polynomial of order j at 0. Then 
OSS (x)S14 fix) +|x17/24 fa(—)+|x17/24, [x1 <2, 
by Taylor’s formula. Hence 
fC) + fal S1+1x17/2+1x14/24, |x] S2. 
If |x| =1 we obtain if x is also replaced by 2x 
fix) + fa S3/2+ 1/24, |2fi(x) + 8fa(~)| $3 + 2/3, 


which implies 
61 fi S16,  6/ f3(x)| $7. 


Thus |/{{(0)<3 and |f|§(0)S7. From Taylor’s formula it follows now that 
(8.6.16) and (8.6.17) hold if r is small enough. 
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First assume that |f|§(0)=1 and that f(0) is small. By Lemma7.7.2 we 
have [rf’ (0)? <2f(0) if r<4, so f’(0) is also small. The quadratic form f,(x) 
has an eigenvector y with eigenvalue +4. After an orthogonal transfor- 
mation we may assume that y=e, =(1,0,...,0). Thus 67f(0)/0x?= +1 and 
0? f (0)/0x, @x;=0 for j +1. Since 

OSS (x1,0,...,0)S f(0) +x, Af (O)/Ax, +x2/24+7 |x, 17/6 + x4/24 

we conclude by taking x,=+4 that 67f(0)/Ox}=+1 if f(0)<1/100, for 
1/100 —1/18 + 7/162 +3-*/24<0. If |x,|=r and |x’|<r, x’=(x,,...,x,), then 
the estimate 

|Af (x)/0x, —6f O)/Ox, —x,|<4|x|? +|x|°/6 
implies that 0f/dx, has the sign of x, if 

r>|éf(0)/dx,|+4r? +r3/6. 
We fix r so small that 4r+r?/6<1/2 and 1/2<6?f(x)/0x?<2 when xeB,,. 
If f(0)<r*/8 then |f’(0)|<7r/2 so we conclude that the equation 
(18.6.19) 6, f(x,,x')=0 
has a unique solution x, = X(x’) with |x,|<r if |x’|<r. Since 
dX /6x;= —0, 0S (X, x’ WO7s (X,x), 

we obtain successively bounds for all derivatives of X of order k from 


bounds for the derivatives of f of order k+1, for 0?f(X,x’)21/2. By 
Taylor’s formula and (18.6.19) we obtain f =f, +g” in B, where 


AiO) = F(X), x) 


is independent of x,, and 
a(x)? =f (x)—f (X(%), x) =(x, — Xx’)? QC), 
1 
O(x)= f Of (x, +t(X(x')—x,), x’)edt. 
0 


We have Q21/4, and the derivatives of Q of order k can be estimated in 
terms of the derivatives of f of order k+2, so g=(x, —X(x’))QteC™(B,) has 
the required properties. 

It remains to examine the case where f(0)2r*/8. Then we can find ry<r 
so that 


f(x)>r*/9 if |xl<7r. 


If we replace r by ro in the lemma then g(x)=f(x)? has the required 
properties. The proof is complete. 


The following lemma is the special case of Theorem 18.6.8 where g is a 
constant metric; it is the essential step in the proof. 
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Lemma 18.6.10. Let g be a positive definite quadratic form in IR?" with 
g/g? <A72 <1. Let OS aeC™(R"), and assume that 


(18.6.20) lali(w)SA-7, if weIR?” and kSN. 
If N is large enough it follows that 
(18.6.21) (a”(x,D)u,u)2—C lull’, ue, 


where C is independent of g and of a. 


Proof. In the proof we may assume that a¢.¥(IR2"). By Lemma 18.6.4 and 
Theorem 18.5.9 it is no restriction to assume that 


g(x, 6)=)Aj(x5 + F)). 
Here 4,</. The hypotheses in the lemma therefore remain valid if we 
replace all A, by A, and (18.6.20) can then be written 
(18.6.20) jalé(w) SA"? OkKSN,  welR?". 


Here e is again the Euclidean metric form. We may also assume that the 
lemma has already been proved for lower dimensions than 2n. This implies 
that (18.6.21) is valid if a is independent of €,, say, for a”(x, D) can then be 
regarded as an operator in the variables x’=(x,,...,x,) depending on x, as 
a parameter, and it has a lower bound —C for every x,. Using Theo- 
rem 18.5.9 we conclude that this is also true if a is constant in some other 
direction. This will allow us to handle the term f, in Lemma 18.6.9. 

As in the proof of Theorem 18.6.7 we shall change the metric Ae to a 
metric G with G, .= H(x,¢)e such that H<1 and aeS(1/H*,G). This re- 
quires in particular that 

asH-*, |al§sH™'. 
To make sure that these conditions are satisfied we define 
(18.6.22) 1/H(w)= max (1, a(w)}, |al$(w)). 
Now we apply Lemma 18.6.9 to 
f(2)=HWwy a(w+2/H(w)}), — zelR?". 
From (18.6.20) we obtain |f/§ <1, and we have 


fO)=Hw) aw), [F156 )= Hw) |als(w). 


These quantities are <1 by (18.6.22). We can always apply Lemma 18.6.9 to 
F(z)+1-—f(0). From (18.6.16) we obtain 


1/H(w)S2/H(w,) if [w—w,|? H(w) <r’, 
that is, G,,$2G, if G,(w—w,) <r. 


Hence G is slowly varying so it follows from Proposition 18.5.6 that G is o 
temperate. The proof in the present situation is so simple. that we repeat it 
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to emphasize the uniformity. We must show that 


G,,, SCG, (1+ Gf, (w—w,)) 
or equivalently that 


H(w,)S CH(w)(1 +|w—w,|?/H(,)). 


In doing so we may assume that G,(w—w,)2c, that is, that H(w)|w 
—w,|? 2c, and then the estimate holds with C=1/c. 

Choose ye CP {z;|z|<r} real valued and equal to 1 in {z;{z|<r/2}. Then 
we have uniform bounds for y(z)f(z) and its derivatives of order <N. This 
follows from Lemma 18.6.9 and the fact that (18.6.20) gives estimates for the 
derivatives of f of order 24. By an obvious modification of Lemma 18.4.4 
we can find a sequence w,eIR*" such that there is a fixed bound for the 
number of overlapping balls B,={w;G,, (w—w,)<r7} and for suitable real 
valued $,¢CP(B'), B= {w; G,, (w—w,)<r?/4}, we have |9,|7 S$ C, and 


Lowy =1. 


Set 
a,(w) = x(H(w,)*(w—w,))? a(w). 
Then we have 
Vow? a(w)=a(w);  ayeCe(B,); |al¥SC,H-*, kN. 
We shall prove in a moment that there is a constant C, such that 
(18.6.23) (a¥(x, D)uu)2—C,(u,u), ueF. 
Admitting this we replace u by @(x,D)u and sum. We can consider 
{@,(x, €)} as a symbol in S(i,G) with values in ? ~ Y(C, i’), for at any point 
there are only a fixed number of terms to consider. Hence it follows from 
Theorem 18.6.3 that the operator 
®: utr {b"(x, D)u}eL7(IR", 1?) 
is I? continuous, that is, 
(18.6.24) d lor x, D) ul]? SC; lull. 


The sum )’ $¥(x, D)a¥(x, D) ¥(x, D) can be regarded as the composition of 
®, the operator A with the diagonal matrix symbol {a,(x, ) 0.3 in 
F(R", I’), and the adjoint of ©. We have control of as many seminorms of 
the symbol of A in S(H~?,G) as we like. Since ¢, {a,, ,} + {¢,,4,} 6, =0 the 
first order terms in the composition formula in Theorem 18.5.4 cancel. 


Hence 
DY OF (x, D) a (x, D) ¥ (x, D)=a"(x, D) + R” (x, D), 


where any desired number of seminorms of R in S(1,G) are bounded. By 
Theorem 18.6.3 and (18.6.23), (18.6.24) it follows that 


(a(x, D)u, u)= =o (x, D) 6” (x, D)u, 6” (x, D)u)—(R” (x, D)u, u) 


— Cz Cs |jull?— C, lull? 
as claimed. 
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It remains to verify (18.6.23). In Lemma 18.6.9 we can write 


x2Y f(2)=x(2Y + (a2) a2)" 
where f,20 is obtained by multiplying f, with a cutoff function which is 
constant in the direction y and 1 in supp y. Then we have bounds for the 
derivatives of f, of order <N—2. Going back to the original variables we 
set H,= H(w,) and 
=U. —w,) H}, by = f,((. —w,) HD/H,, 
c,=(18)((. —w,) H3)/H, 

where f, and g are obtained from Lemma 18.6.9 applied to H? a(w,+./H?). 


By (18.6.18) we have 
a= b+, 

and we have bounds for any desired number of seminorms of b,, y, and c, 
in S(H,?, H,e), S(1,H,e) and S(H;>1, H,e). By the inductive hypothesis 

(bY (x, D) u, u) = — C; |||. 
From Theorem 18.5.4 we obtain as above a bound for any number of 
seminorms of the symbol of a*(x, D)—y"(x, D)b¥(x, D) x" (x, D) 
—cy (x, D)cl(x,D) in S(,H,e) if N is. large, so Theorem 18.6.3 gives a 
bound for the norm of this difference. Thus (18.6.23) and the lemma are 
proved. 


Proof of Theorem 18.6.8. We just have to repeat a part of the proof of 
Lemma 18.6.10. Choose ¢, as in Lemma 18.4.4 but so that ) ¢?=1. We can 
arrange so that ¢,w~,=¢, for some non-negative y,¢C?(B,) which are also 
bounded in S(1,g). Set a,=w,a, which implies that a=) ¢,a,@,. Hence we 
obtain as before that 


Dy (x, D) ay (x, D) by (x, D)—a"(x, D) 
is a bounded operator. By Lemma 18.6.10 we have 
(a}’(x, D)u, u) = —C |||? 
where C does not depend on ¥, so it follows that 
© (a}(x, D) b} (x, D) u, $2 (x, D) u)= — C' |u|? 
since (18.6.24) remains valid for our new choice of ¢,. This completes the 


proof. 


Specializing to the metric (18.4.1)’ we have proved: 


Corollary 18.6.11. If 0<aeS?4-°(R" xR") and OS5<pX1 then a”(x, D) is 
bounded from below and so is a(x, D)+ a(x, D)*. 


It is not possible to replace 4~? in Lemma 18.6.10 by a larger power: 
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Example 18.6.12. Set b(x, =x é, (x, 2)eR?. Then 

b” (x, D)? =a”(x, D) +4 
where a=b’, for —-8~'(0,0,-0,0,)° x €yn=4. Thus 

(a”(x, D)u, u)= ||bY (x, D) ull? — ||u/2||?, ue S. 
The equation b”(x, D)u=0 can be written 2xu’+u=O so it is satisfied by 
u(x)=x—?, Let y€C® be equal to 1 in a neighborhood of 0 and set for small 
e>0 
u,(x) = (x(@ x) — x(x))/xI?. 
Then it follows that 
b"(x, Dy u,(x) =|x|~# x D(x(ex) — x(x), 


so ||b”(x, D)u,|/?<C but jlu,|/?Aoge— —2 as e>0. Thus (a”(x, D)u,,u,)<0 if 
é is small enough. Now choose a,¢C?(IR’) so that ay 20 and a(x, £)= x? £? 
in a neighborhood of 0. When 4-0 we obtain 


(a%(V/'4x, V/A D)u,, u,)/2? (a(x, D) u,, u,), 


for a)(VAx,V/46)/22 converges weakly to a in S((1+x?+7)?,e) when 4-0. 
Since the right-hand side can be negative this proves that we cannot 
improve the exponent — 2 in (18.6.20) or in Theorem 18.6.8. 


However, the condition a=0 in Theorem 18.6.8 and Lemma 18.6.10 is 
too strong. The reason is that in the proof of Lemma 18.6.10 we discarded 
quadratic terms which can give essential contributions when combined with 
other such terms in a later stage of the inductive proof. We shall return to 
this matter in Section 22.3. 

Theorem 18.6.7 remains valid in the vector valued case but cannot be 
improved then. The key to the proof of the positive result is the following 
rather weak analogue of Lemma 18.6.10. 


Lemma 18.6.13. Let g be a positive definite quadratic form in R?" with 
g/e°S/7? <1. Let O<aeC™(R*", L(H,H)) where H is a Hilbert space, and 
assume that 


(18.6.25) lali(w)SA-* if welIR?" and kSN. 

If N is large enough it follows that 

(18.6.26) (a”(x, D)u,u)=—C |lull?,  ueS(R", A). 

Proof. As in the proof of Lemma 18.6.10 we may assume that g=Ae where e 
is the Euclidean metric form. Then 

(18.6.25)’ jal?(w)SA®- 20/2, 
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Let Ao+A,(x, €) be the first order Taylor expansion of a at 0. By (18.6.25)' 
we have for all veH 


(Ao 0, v) + (A, (x, 0) 0, v) + |x]? [loll?/220, 

(Ao, v)+(A,(0, &)v, v) +]€]? loll?/220. 
If ue SYR", H) we obtain by aopivns these inequalities with v=u(x) or v 
=1(€) respectively, with x and € replaced by 2x and 26, 
(18.6.7) (Agu) + (Ay (x, D) uu) = — D(x jull? + |D,ull?), 


where the scalar products are now in L7(IR",H). (This is a substitute for 
Lemma 7.7.2.) Next we prove that 


(18.6.28) (a”(x,D)u,u)= —C)'(\|x;ull? + |D;ull). 
To do so we use Taylor’s formula to write 
a(x, = Ag + Ay (x, E+) xX, Rul V+ DY Fj 64 Sul 2) 
+29) x56, Tal, &), 


where R,, and S,, are symmetric in j and k and we control any number of 
seminorms of R,, S, and T,, in S(1,g). Then we have 


a” (x, D)=Agt+ A, (x, D)+>)x; RR, ole S¥,(x, D) D, 
+). x, T(x, D)D, +YD, TY (x, D) x ;— R” (x, D) 

where 

4R=)P 0? Ry JOE; OE, + YO? S/0x,0x,—2¥ 0? Ty,/O€ ; OX, 
has bounded seminorms in S(A,g). This follows from the fact that left (right) 
mulepication by x, or D, means that the symbol is multiplied by x, 
+5i0/0E, or €; F4id/6x,, after a short calculation. The estimate (18.6.28) i is 
an Finmiediate consequence since ||u||? <||x,ul]? + ||D, ull’. 

From (18.6.28) it follows that more generally 
(18.6.28) — (a*(x, D)u,u)= — CY (I(x; —y ull? + lDj—n ull?) 
for arbitrary (y,4)eIR2". We just have to apply (18.6.28) to a(x+y,é+m) with 
u transformed according to Theorem 18.5.9 to verify this. Now choose 
geCP(R") so that 
LV oe-vP= 


when v runs over the lattice points. Set 6,(x)=(x —v) and apply (18.6.28)' 
with y=v and u replaced by ¢,u. Then 
DVil@;—v) eu? SC lal? 


We can use the calculus with the metric |dx|?+A|dé|? to compute 


doy (x, D).a” (x, D) bY (x, D). 
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The main term in the sum is a and the first order terms cancel, so we 
conclude as in the proof of Lemma 18.6.10 that the symbol is a+b where we 
have bounds for a large number of seminorms of b in S(1,|dx|?+A|d€|°), 
hence also a bound for the norm. It follows that 


(18.6.28)” (a(x, D) u,u)= — C(\ull2+¥, |(D,—n) ull). 


Here we replace u by ¢”(D)u and y by v. Repetition of the preceding 
argument then gives (18.6.26). The proof of the lemma is complete. 


From the lemma we obtain at once using the localization argument in 
the proof of Theorem 18.6.8: 


Theorem 18.6.14. Let g be a o temperate metric and assume that (18.6.11) 
holds. If aeS(i/h,g) takes non-negative values in &(H,H) where H is a 
Hilbert space, it follows then that 


(a(x, D)u,u)=—C |\ul?, we A(R", H). 


Notes 


Pseudo-differential operators have developed from the theory of singular 
integral operators; these are essentially pseudo-differential operators with 
homogeneous symbol of order 0. In the theory of singular integral operators 
only the principal symbol is studied. Its multiplicative properties appear 
somewhat mysteriously since the Fourier representation is avoided, and this 
seems to be the historical reason for the term. (See Seeley [5].) Singular 
integral operators were introduced in the study of elliptic problems, but it 
was realized in the 1950’s that they are not really essential then. The work 
by Calderon [1] on the uniqueness of the Cauchy problem gave another 
testimony to their importance, but in the predecessor of this book his 
results were proved and extended by direct methods based on partial 
integration, Fourier transforms and localization techniques. It seems likely 
that it was the solution by Atiyah and Singer [1] of the index problem for 
elliptic operators which led to the revitalization of the theory of singular 
integral operators. Anyway, shortly afterwards Kohn and Nirenberg [1] 
introduced pseudo-differential operators with general polyhomogeneous 
symbols. Thus they removed the artificial restriction to order 0 and gave 
rules of computation for terms of lower order, which made the new tech- 
niques highly competitive. Their proofs and definitions relied on the Fourier 
representation except for the change of variables which was based on a return 
to singular integral operators. This remnant was removed shortly afterwards 
by H6rmander [16]. The need to incorporate fundamental solutions of hy- 
poelliptic operators of constant strength led to the introduction of symbols 
of type p, 6 in Hérmander [18], by many considered as an excessive 
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generalization. However, these were in fact inadequate for the study of 
differential operators of principal type, which led Beals and Fefferman [1], 
Beals [1] to such a great extension of the class of symbols allowed that one 
can adapt the symbois to the operator being studied. An extension of their 
techniques, adapted to the Weyl calculus which has a tradition in quantum 
mechanics going back to Weyl [5] is presented in Sections 18.4 through 
18.6. We have mainly followed Hérmander [39] but some extensions are 
adopted from Dencker [2]. One of the applications of the general calculus 
given by Beals and Fefferman [1] was the “sharp Garding inequality” first 
proved in Hérmander [17] and later extended to the vector valued case by 
Lax and Nirenberg [1]. In Section 18.1 we give an elementary proof of the 
original result which will be needed frequently in the later chapters. A 
generalization of the Lax-Nirenberg result as well as a much more precise 
statement in the scalar case due to Fefferman and Phong [1] are given in 
Section 18.6. For improvements in another direction we refer to Section 22.3 
and the notes to Chapter XXII. The key to the estimates in Section 18.6 is a 
result on sums of almost orthogonal operators often referred to as Cotlar’s 
lemma which was proved by Cotlar [1] and by Knapp and Stein [1]; the 
more general statement given here is due to Calderén and Vaillancourt [1, 
2]. 

The conormal distributions discussed in Section18.2 were defined in 
Hormander [26]. They constitute the simplest case of Lagrangian (Fourier 
integral) distributions which will be discussed fully in Chapter XXV. The 
results on the transmission condition are due to Boutet de Monvel [1]; the 
extended version in Theorem 18.2.18 comes from old lecture notes inspired 
by the work of Visik and ESkin [1-5]. Section 18.3 is almost entirely due to 
Melrose [1]. We refer to his paper for further developments of the theme 
parallel to the theory of Fourier integral operators in Chapter XXV. 


Chapter XIX. Elliptic Operators 
on a Compact Manifold Without Boundary 


Summary 


For an elliptic pseudo-differential operator on a compact manifold it follows 
immediately from the calculus of such operators that the kernel and the 
cokernel are both finite dimensional. Thus elliptic operators are Fredholm 
operators. The main topic of this chapter is the study of the index, that is, 
the difference between the dimensions of the kernel and the cokernel. This is 
an interesting quantity to study because it is very stable under pertur- 
bations; for many operators which occur in geometry the index gives 
important information on the topology. Thus the classical Riemann-Roch 
theorem, as well as some of its modern analogues for several complex 
variables, is a case of the index theorem. 

..&We start in Section 19.1 by reviewing abstract Fredholm theory. In doing 
so we add some points which are not quite standard. These concern the 
stability of the index of strongly continuous families of operators, the 
expression of the index by means of traces in the case of operators in 
Hilbert space, and finally related results on invariance of Euler characteris- 
tics under passage to homology. The main analytical properties of the index 
of elliptic pseudo-differential operators are then established in Section 19.2. 
For pseudo-differential operators in IR” an explicit index formula is proved 
in Section 19.3 by means of evaluation of certain traces. As indicated at the 
end of Section 19.2 the results of Sections 19.2 and 19.3 can be used to 
derive the Atiyah-Singer index formulas. However, this is mainly a problem 
in differential geometry, and we do not wish to develop the necessary 
prerequisites on characteristic classes to discuss it so the remarks will be 
quite brief. Instead we pass in Section19.4 to the Lefschetz formula of 
Atiyah and Bott which can be stated and proved without any extensive 
background in geometry. A discussion of the extent to which ellipticity is a 
necessary condition for the Fredholm property is given in Section 19.5; a 
more general notion of ellipticity is introduced in this context. 


19.1. Abstract Fredholm Theory 


If B, and B, are finite dimensional vector spaces (over €) and T is a linear 
transformation B, > B, then the rank of T is the codimension of the kernel 
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Ker T or equivalently the dimension of the range, that is, 
dim B, — dim Ker T=dim B, ~dim Coker T 
where Coker T= B,/TB,. This means that 
dim Ker T—dim Coker T=dim B, —dim B, 


is independent of T. This is the basic reason for the stability properties of 
the left-hand side in the infinite dimensional case; it is called the index of T. 
If B, and B, are Banach spaces and Te Y(B,, B,) then 


Ker T={feB,; Tf =0} 


is a closed subspace of B, but need not be finite dimensional. The range 
TB, need not be closed, but we have 


Lemma 19.1.1. If Te Y(B,,B,) and the range TB, has finite codimension in 
B, then TB, is closed. 


Proof. We may assume that T is injective for otherwise we can consider 
instead the map from B,/Ker T to B, induced by T. If n is the codimension 
of TB, we can choose a linear map 


S: (’>B, 

such that SC" is a supplement of TB,, that is, the map 

T,: B,OBC’3a(x, yo Tx+S yeB, 
is bijective. By Banach’s theorem it is then a homeomorphism, which proves 
that TB, =T,(B, @ {0}) is closed. 
Definition 19.1.2. Te Y(B,,B,) is called a Fredholm operator if dim Ker T is 
finite and TB, (is closed and) has finite codimension; one then defines 
(19.1.1) ind T=dim Ker T — dim Coker T. 


Sometimes we shall also use the definition (19.1.1) when only dim Ker T 
is finite and TB, is closed. This situation can be characterized as follows: 


Proposition 19.1.3. If Te #(B,,B,) the following conditions are equivalent: 

(i} dim Ker T< 00 and TB, is closed. 

(ii) Every sequence fj¢B, such that Tf, is convergent and f; is bounded has 
a convergent subsequence. 


For the proof we need a classical lemma of F. Riesz: 


Lemma 19.1.4. The unit ball in a Banach space B of infinite dimension is 
never compact (in the norm topology). 
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Proof. We can choose a sequence f,¢B such that 
IfM=1S64+¥ afl 
k<j 


for all j and all a,e@. In fact, if f,,...,f;_, have already been chosen we can 
take g outside the linear space L spanned by these elements. There is a 
point heL which minimizes ||g—hj|| for L is finite dimensional and the norm 
+ if hoo. Then f,=(g—h)/|[g—h|| has the required properties. In partic- 
ular we have | f,—f,||21 for j+k so there is no convergent subsequence. 


Proof of Proposition 19.1.3. If (ii) is fulfilled it follows in particular that the 
unit ball in KerT is compact, so KerT is finite dimensional by Lem- 
ma 19.1.4. By the Hahn-Banach theorem the finite dimensional space Ker T 
has a topological supplement B,<B,. It also follows from condition (ii) 
that 


(19.1.2) IfI,SCITSil., feBo. 


In fact, otherwise there is a sequence f;eB, with | fj|,=1 and | Tf|,<1/. 
By condition (ii) a subsequence has a limit f such that feBo, ||f||=1, Tf =0 
which is a contradiction proving (19.1.2). Conversely, assume that (19.1.2) is 
valid and that f; is a bounded sequence in B, such that Tf; is convergent. 
We can write f,=g,+h, where g,eKer T and h,;¢By are also bounded. Since 
Th,= Tf; it follows from (19.1.2) that the sequence h, is convergent, and the 
bounded sequence g, in the finite dimensional space Ker T has a convergent 
subsequence. Thus (ii) holds. Now it follows from Banach’s theorem that 
(19.1.2) holds if and only if T restricted to Bo is injective with closed range. 

This proves the equivalence of (i) and (ii). 


Our first stability result on the index is the following 


Theorem 19.1.5. If T satisfies the conditions in Proposition 19.1.3. and 
Se Y(B,,B,) has sufficiently small norm then dim Ker(T +S)Sdim Ker T 
T+S has closed range, and ind(T + S)=ind T. 


Proof.. Assume first that T is bijective. Then 
T+S=T(UI+T™'S) 
is bijective if |T~‘|| |S||<1, for I1+T-*S can then be inverted by the 


Neuman series. Thus the theorem is valid then. Next assume only that T is 
injective. Then we have by (19.1.2) 


Iflis SCF 12S CMT+S SF ,+ CSI NF. . 


Hence ||f],S2C\\(T+S)f\|, if C\|Sii<4, which means that T+S is in- 
jective with closed range then. If dim Coker T < 00 we can choose a finite 
dimensional supplementary. space W of TB,. Let q be the natural map 
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B,—B,/W. Then qT is bijective so qT+qS is bijective if ||S\| is small 
enough, thus 

ind(T+S)= —dim W=ind T 
then. If we just know that dim Coker T > v we can choose W of dimension v+1 
with W TB, = {0} and obtain ind(T +S) < —v if ||S|| is small enough. If we 
apply this with T replaced by T +S, we see that the sets 


{S; C ||S|| < 4,ind(T+S)=—v}, {S; C ||S}] <4, ind(T +S) < -v} 
are open for every v. Because {S; C ||S|| < $} is connected it follows that 
ind(T + S) must in fact be constant. 

If N = KerT # {0} we choose a topological supplement Bo C B, of N, thus 
B,; = Bo ON. Then it follows from what we have already proved that T +S 
restricted to Bo is injective with closed range and that 


codim(T +S) B, =codim TB,=codim TB, 
if |S]| is small enough. If N’=Ker(T+S) then N’ B,={0} and B, 


=N’®W@B, for some finite dimensional W. The codimension of (T+S) B, 
=(T+S)W@(T+S)B, is equal to codim TB, —dim W, so 


ind(T+S)=dim N’+dim W —codim TB, 
=dim N —dim Coker T=ind T 
which completes the proof. 


Corollary 19.1.6. The set of Fredholm operators in £(B,,B.) is open, 
dim Ker T is upper semi-continuous, and ind T is constant in each component. 


Corollary 19.1.7. If T,e £(B,,B,) and T,e£(B,,B,) are Fredholm operators 
then T, T,ecL£(B,,B;) is a Fredholm operator and we have “the logarithmic 


law” 
ind(T, T,)=ind T, +ind T,. 


Proof. Since T, maps Ker T, T, into Ker T, with kernel Ker 7, we have 
dim Ker T, 7, Sdim Ker 7, +dim Ker T,. Similarly, dim Coker T,7,S 
dim Coker T, + dim Coker T) so it is clear that T, T, is a Fredholm operator. 
If I, is the identity operator in B, it follows in block matrix notation that 


(7 ) ( I, cost pant) (¢ | 

0 T,/ \-I,sint I,cost/ \0 I, 

is for every t a Fredholm operator from B, @B, to B,®B;. When t=0 it is 
the direct sum of the operators T, and T,, and when t=—7/2 it is the 


operator (f,,f,)h->-(—f,, 7,7, f;). The index is clearly ind T,+ind T, and 
ind(T, T,) in the two cases which proves the corollary. 


Corollary 19.1.8. If Te Y(B,,B,) is a Fredholm operator and Ke £(B,,B,) is 
compact, then T+ K is a Fredholm operator and ind(T + K)=ind T- 
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Proof. If f,; is a bounded sequence in B, such that (T+K)f; is convergent, 
then the compactness allows us to choose a subsequence f,, such that Kf,, 
is convergent. Thus Tf, is convergent, and it follows from condition (ii) in 
Propositon 19.1.3 that the sequence f;, has a convergent subsequence. Thus 
another application of Proposition 19.1.3, now to T+K, shows that T+K 
has finite dimensional kernel and closed range. By Theorem 19.1.5 the index 
of T+2K is therefore a locally constant function of ze, hence independent 
of z. Thus T+2K is always a Fredholm operator with index equal to ind T. 


Corollary 19.1.9. If Te £(B,,B,) and for some S,;eL(B,,B,) we have TS, 
=I,+K,, S;T=1,+K, where K, are compact, then T, S, and S, are 
Fredholm operators and ind T= —indS,, j=1, 2. 


Proof. T is a Fredholm operator since 
dim Ker TS dim Ker(I, + K,), 
dim Coker T < dim Coker (I, + K,). 


Since 
Si TS,=S,+S, K,=S8,+K, S> 


it follows that S,—S, is compact. Hence S,T—I, and TS,—I, are also 
compact so S,; and S, are also Fredholm operators by the first part of the 
proof, and ind T+indS;=ind([;+K,)=0 by Corollary 19.1.7 and Corol- 
lary 19.1.8. The proof is complete. 


The index has much stronger stability properties than stated in Corol- 
lary 19.1.6: 


Theorem 19.1.10. Let B, and B, be two Banach spaces and let I be a compact 
space. If T.eL(B,,B,) and S,¢eL(B,,B,) are strongly continuous as functions 
of tel, and if 

K,,=8,T,-1,, K,,=T,8,—-1, 


are uniformly compact in the sense that for j=1,2 
M;= {K,,f; tel, feB,, If lj;s 1} 


is precompact in B,, then T, and S, are Fredholm operators, dim Ker T, and 
dim Ker S, are upper semi-continuous, and ind T,= —ind S, is locally constant, 
hence constant if I is connected. 


Proof. T, and S, are Fredholm operators by Corollary 19.1.9. Let 


N={(t, fel x Bi, |flli=1l, 1 f=0}. 


This is a closed subset of Ix M,, for T,f=0 implies f = —K,,feM,. Hence 
N is compact. The proof of Lemma 19.1.4 shows that 


I,={tel; dim Ker T, 2k} 
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is the set of all ¢ such that there exist f,,...,f, with (,fjeN and 
lit Dafil21 


i<j 
for j=1,...,k and all a,e€. Thus I, is the projection in J of a compact set in 
N* and therefore compact, which means that {t;dim Ker T,<k} is open. 
Hence dim Ker T; is upper semicontinuous and the same is of course true for 
dim Ker S,. 

Since ind T,+indS,=0 the theorem will be proved if we show that ind T, 
(and therefore indS,) is upper semi-continuous. Let t,¢ and choose a 
topological supplement B)<B, of Ker T,,. Since the injection B,—B, has 
index —dimKerT,, the index of the restriction of T, to By is ind T, 
— dim Ker T,,. We must therefore show that codim T, B,2codim T,, B, when 
t is in some neighborhood of ty. Choose W with dim W=dim Coker T,, and 
Wo T,, B, = {0}. The set 


Nw ={(tf)el x Bo, If =1, Tfew} 


is compact. In fact, Ny is closed. When (,f)eNy we have f=S,T,f ~K,,f 
where K,,feM,, and 
ITF l2SCilfly=C 


since T, is strongly continuous in t. The compactness of N, follows now 


from the fact that 
{(, g)el x W; Ilgll2SC} 


is compact and that (t,g)h> S,g is continuous there. We can now conclude 
that the projection in J 


I¥ ={t; (t,f)e Nw for some f} 


is compact. The complement is therefore open and contains fo by the choice of 
W. For all ¢ in a neighborhood of f& we have therefore proved that 


codim T; By 2 dim W=codim T,, B,, 
which completes the proof. 


The hypotheses of Theorem 19.1.10 are very natural in the study of 
elliptic pseudo-differential operators T,; the operators S, are their para- 
metrices. - We shail now leave the stability properties of the index and 
derive a trace formula, for the index of Fredholm operators in Hilbert 
spaces. It will be used to prove explicit index formulas in Section 19.3. First 
we recall the basic facts on Hilbert-Schmidt and trace class operators. 


Definition 19.1.11. If H, and H, are Hilbert spaces then the space 
&,(H,,H,) of Hilbert-Schmidt operators from H, to H, consists of all 
Te L(H,,H,) such that 


IT2=d I Te;ll 


is finite, if {e,} is a complete orthonormal system in H,. 
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The notation already presupposes that the sum is independent of the 
choice of orthonormal basis, which is indeed the case since for any complete 
orthonormal system {f;} in H, we have 


Ve? =) Te? = dle. TAA? =D MT GI. 


This shows also that T*e¥,(H,,H,) and that T* has the same Hilbert- 
Schmidt norm as T. If S,eY(H,,H,) and S,e(H,,H‘) for some other 
Hilbert spaces H, and H), we also have S, TS,e¥,(H,,H,) and 


IS TSyli2S US. TH2 Sy I, 
the norms of S, and S, being operator norms. In fact, 
IS_ TSy lp SUS20 TSy2=TS2i IST T*,SUS21 UST AT * ho. 


If Te Y(H,,H,) has finite rank, that is, the kernel N has finite codimen- 
sion, then we can factor T as the canonical operator H,- H,/N followed by 
an operator H,/N—-H,. Since H,/N is finite dimensional the second opera- 
tor is a Hilbert-Schmidt operator and we conclude that Te#,(H,,H,). 
Operators of finite rank form a dense subset of ¥,(H,,H,) since we can 
approximate any Te ¥,(H,,H,) arbitrarily closely by changing the defini- 
tion of Te; to be 0 except for a large but finite subset of indices. In 
particular we conclude that all Hilbert-Schmidt operators are compact. 

If TjeL,(H;, Ho), j=1,2, it follows that T=T;* T,e,(H,,H,), but the 
product has additional properties. In fact, if {e,} and {f,} are orthonormal . 
systems in H, and H, labelled by the same index set, then 


DV Te,, f= LUT, ej, To ASAT Mle Tolle. 
On the other hand, let Te Y(H,,H,) and assume that 
(19.1.3) Li(Tej.f)< 00 


for all orthonormal systems {e,;} and {fj} in H, and H, labelled by the same 
index set. Let A=(T* T)? which is a bounded self-adjoint operator in H,. If 
the kernel is N, the range is dense in the orthogonal complement N’ of N. 


Since ITS |? =(T* TEA)SHNAS I, 


the map Afi Tf is isometric, so its closure is an isometric operator 
U: N’> H,, and T=UA. If {e;} is a complete orthonormal system in N’ it 
follows that f;= Ue, is an orthonormal system in H,, and we have 


» (Te, fl = > (A Cj» e;). 
Thus B= A? is a Hilbert-Schmidt operator and 
|Bil3 =S(A ej» e;)s 


for B vanishes in N. Recalling that |UB],<||B||, we take T,=B and T+ 
= UB, which gives T=T;} T, and 


ITyll2 NT SIBIZ=> (Te, AD 
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The opposite inequality is always true so we must have equality. Summing 
up, we introduce the following definition: 


Definition 19.1.12. If H, and H, are Hilbert spaces then the space 
Y,(H,,H,) of trace class operators from H, to Hy, is the set of all 
Te L£(H,,H.) such that (19.1.3) is valid for all orthonormal systems {e;}, 
{fj} in H, and Hy, respectively, or equivalently T=TT, where 
TjeL,(H,,H,) for some other Hilbert space Hy. The norm in #,(H,,H,) is 
defined by 


(19.1.4) [Tl], =sup >) |(Te;.f)=inf ll Tl Tall, 


the supremum (infimum) being taken over all such {e,}, { f;} (resp. T,, T,). 


The triangle inequality and the completeness follow from the first part of 
(19.1.4). From the second part we conclude that if TeY,(H,,H,) and 
S,¢eL(H,,H,), 8,¢-L(H,, WH) then S, TS,e£,(H,,H,) and 


IS. TSyiySUSoi NTs Sal. 


Hence the operators of finite rank are of trace class, and they form a dense 
subset since this is true for Hilbert-Schmidt operators. 
If Te, (H, H) we define the trace of T as 


Tr(T)=Tr(T|H)=) (Te;,e,) 
where {e,} is a complete orthonormal system in H. This is independent of 
the choice of {e,}, for if we write T=T,* T, with T,e@,(H, Ho) then 


2 
It, T, +t. T3= Y YGF Te;,e)t, ty. 
veu=l fj 
The coefficient of t,t, is independent of the choice of e,; and it is equal to 
Tr(T). Thus the trace is uniquely defined and it is a linear form of norm 1 
on &,(H, A). 


Proposition 19.1.13. If Te&,(H,H) and H’ is a closed subspace of H such 
that TH'<H’ then the restriction T’ of T to H’ and the operator T” in H" 
= H/H' induced by T are both of trace class, and 

(19.1.5) Tr(T|H)=Tr(T’|H’)+Tr(T'1 A”). 

If T,eL,(H,,H,) and S is a continuous bijection H,— H,, then 

(19.1.6) T,=ST,S~'e¥,(H,,H,),  Tr(T,|H,)=Tr(T,|H,). 


Proof. That T’ is of trace class follows from the definition of trace class 
operators in terms of (19.1.3). We can identify H” with HOH’. If P is the 
orthogonal projection H- H” it is clear that T” is the restriction of PT to 
HH’, hence of trace class. If we take an orthonormal basis {e'} in H’ and one 
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{e/} in H” then they form together an orthonormal basis for H and (19.1.5) 
follows at once. If Te Y,(H, H) is of finite rank and N is the kernel of T, it 
follows from (19.1.5) that Tr(T) is the trace of the operator induced by T in 
the finite dimensional vector space H/N. Hence it is independent of the 
norm in the Hilbert space, which proves (19.1.6) when T, is of finite rank. 
Since operators of finite rank are dense in ¥,(H,,H,) and the map 
L£(H,,H,)3T,b>ST, S~'eL,(H2, H2) is continuous, we obtain (19.1.6). 


We can now prove a well-known trace formula for the index: 


Proposition 19.1.14. Let Te f(H,,H,), Se L(H,,H,), and set 
R,=1,-ST, R,=I1,-TS, 


where I, is the identity operator in H,. Assume that R* and R¥ are of trace 
class for some N>O. Then it follows that T is a Fredholm operator and that 


(19.1.7) ind T=Tr(R4)—Tr(R3). 
Proof. Assume first that N=1. By Corollary 19.1.9 the compactness of R,; 
implies that T is a Fredholm operator. Note that 

TR,=R,T 


and write H,=N,@®H; where N, is the kernel of T and H’, is the range of T. 
Then T induces a bijection H,—>H’,. If PB is the orthogonal projection 
H,-H;, then TRR,=TR,=R,T. Hence R, maps Hy, to itself, and it 
follows from (19.1.6) that 


Tr(R,|H,)=Tr(F R,|H}). 
Now R, is the identity on N, so we obtain from (19.1.5) 
Tr(R,)=dim N, + Tr(A R,|A}). 


Furthermore, [,-—R,=TS maps H, into H’,, so R, induces the identity on 
H,/H’,. Using (19.1.5) again we obtain 


Tr(R,)=dim N,+Tr(R,| 4) 
which proves that 


Tr(R,)—Tr(R,)=dim N, —dim N, =ind T. 
Thus Proposition 19.1.14 is proved when N =1. If N>1 we replace S by 
S’=S(1,+R,+...+R¥~'). 
It is then obvious that TS’=I,—R}, and we have 
S’T=S(I,+(,—TS)+ ...+(1,-TS)*—') T. 
Since S(TS)* T=(S T)**! and 
x(1+(1—x)4+...+(1—x)¥~4)=1-(1—x)’, 
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it follows that 
S'T=1,-(U,-ST)*=1, —R}. 


The proof is therefore reduced to the case N=1 and so complete. 


The first part. of the proof is a special case of the following result on 
invariance of Euler numbers which will be the basis for our work in 
Section 19.4. 


Theorem 19.1.15. Let 


Tn-1 


(19.1.8) H_,={0} +H,—> H,—+H, >... 2S Hy > (0}=Hy, 1 


be a complex of Hilbert spaces with all T; continuous with closed range. Thus 
we assume T;T;_,=0, that is, 


T,_,H,_,c{feH,;T,f=O}=N, j=i,...,N. 
Let R»e£,(H,;,H;) commute with the complex, that is, 
(19.1.9) R,T,_,=T;_ i Rj_ po 2 fliaeg NG 


Then R(T;_,H;_,)<T;_, Hj_, and R;N;<N; so R; induces an operator R, 
of trace class in H,=Nj/T;_, H;_,. We have 


N N 
(19.1.10) ¥(- 1)! Te(R,|H)=¥(—)! Tr(R A). 
0 0 


Proof. By (19.1.9) we have 
R(Tj_;4j;_)=T),_,R;-, Bj. ¢ Tj; Ay-1, 


and R,N,cNj since T;R;N;=R,,,7,N;={0}. From Proposition 19.1.13 it 
follows that R; as well as ihe aperatar Rj, induced by R; in H,/N, are of 
trace class and ‘that 


Tr(R,|Hj)=Tr(R,| Nj) + Tr(Rj A /N)) 
=Tr(R,|T;_, H;_,)+Tr(R,|H)+Tr(RjlA,/N). 


j-1 


Now 1; defines an isomorphism H ,/N,> 7;H,, and by (19.1.9) it transforms 
Rj to R;,, restricted to T;H;. Thus 


Tr(R,|H)=Tr(R,{T;_, Aj.) +Tr(R lH) +Tr(R;, | 7H). 


The first and the third terms drop out when we form the alternating sum in 
(19.1.10) which proves this formula. 


The complex (19.1. 8) is called a Fredholm complex if all T; have closed 
range and the spaces H,= Ker T,/T,;_, H;_, are finite dimensional. A Fred- 
holm operator defines a "Fredholm ‘eoniplek with N=1. Conversely, a Fred- 
holm complex can be split to a Fredholm operator: 
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Proposition 19.1.16. The complex (19.1.8) is a Fredholm complex if and only if 
the operator A defined by 


H™" =@H3;3(fo.f25--)>(Tofo + T#* fos Th fo + T# fs, ...\€PDH2j+1 =H“ 
is a Fredholm operator. The index of A is the Euler characteristic 


¥(-1) dim H,. 


Proof. Since T,_, H,_,<Ker T; the range of T;_, is orthogonal to the range 
of T;*. Hence 


(19.1.11) |ACfosfas +)? = LUT fash? + Tas Aayll- 
Assume first that we have a Fredholm complex. Then 


fall? S CIT Aryl? 


when f,; is orthogonal to Ker T,,, for T,, has closed range. Since T,,_, has 
closed range it follows that Tj,_, has closed range, hence that 


Ifo? SC UTR foil? 


when f,; is orthogonal to Ker T3_,. Now H,, is the orthogonal sum of the 
orthogonal complement of Ker 7,;, that of Ker T#_,, and Aa, which we 
identify with Ker T,, Ker T_,. Hence 


I foil? SCOTa Fil? +1 TH fail’) 


if f,; is orthogonal to Hs for we can then write f,;=g,;+h,,; where g,, is 
orthogonal to KerT,,, hence in KerT3_,, and h,,; is orthogonal to 
Ker T3_,, hence in Ker T,;. The inequality is therefore reduced to the two 
preceding ones applied to 82; and to h,;. Using (19.111) we can now 
conclude that A has closed range and kernel @A2; j- In the same way it 
follows that the kernel of A* is @H2;.1:, which proves that A is a Fred- 
holm operator with index equal to the Euler characteristic. 

On the other hand, if A is assumed to be a Fredholm operator it follows 
from (19.1.11) that W,=Ker T,,;0 Ker T;4_, is finite dimensional and that 


Ifill? S CUT Fal? + 1TH All”) 


Lira f,; is orthogonal to W;. When f,; is orthogonal to Ker T,; we have 

Ti-1f2;=0 so || fai? SC || Ty fol? which proves that T,, has closed range. 
When f,; is orthogonal to Ker T;}_, we obtain that T;,_, has closed range, 
hence T,;_, has closed range. The proof is complete. 


We shall apply the preceding method to a complex defined as a product 
of the complexes associated with two Fredholm operators A,;: H}—> Hi, j 
=1,2. First recall that the tensor product H, @H, of two Hilbert spaces H, 
and H, is defined as the set of continuous antilinear forms on H,xH, 
corresponding to antilinear maps H,-H, of Hilbert Schmidt class in the 
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obvious sense. Thus H, @H,=,(H,,H,) where H, is H, with multiplica- 
tion by scalars changed by complex conjugation. (Equivalently, if we realize 
H, as D(X ;,H)) where pj; is a positive measure on X, then H,@H, is 
naturally isomorphic to L?(X, x X,).) Imitating the definition of the de 
Rham complex of differential forms in IR? we introduce the complex with 
the spaces 


Go=Hi@Hj, G,=Hi@HoOHV@Hi, G,=Hi @H;j, 
and the maps 
Ty (fo @fo)=lAifo @fe.fo @ Azfo), 

T, (fe @fe fo @S)=Aifo @f? ~ft @Azfo- 
Here f*eH}. It is immediately clear that we obtain a complex. Splitting it as 
in Proposition 19.1.16 we obtain an operator A:G,@G,—-G, defined by 
A= (7 @ Is hea) 

13@A, ATO 
in block matrix notation. If F=(Fo, i)e¢H} @ H2@H} @ H? then 
AF =((A; @ 8) Fo—(1i @ AB) Fi, (16 @ Aa) Fo + (At @ 17) Fi) 
and thus 
(19.1.13) | AF||* = ||(A; @ 19) Foll? + (15 ® Az) Fol’ 
+, @ 49) Fl? + ATOM AI? 

It follows immediately that 


(19.1.12) 


(19.1.14) Ker A= Ker A, @ Ker A, @ Ker A¥ © Ker A?. 
The estimate 
|F\?SC\AF\}* 
is valid for all F orthogonal to this space if 
IFIP SCA SI? 
when feHi, is orthogonal to Ker A,, and 
Ig? <C\A* gil? 


when géHij is orthogonal to Ker A¥. These estimates are equivalent, and we 
see that the range of A is closed because A, have closed range. Similarly we 
find that A* has closed range and that 


(19.1.15) Ker A* = Ker Af @ Ker A, @ Ker A, © Ker A?. 
Thus A is a Fredholm operator and 
ind A=dim Ker A, dim Ker A, + dim Coker A, dim Coker A, 
—dim Coker A, dim Ker A,—dim Ker A, dim Coker A,, 


which completes the proof of the following theorem: 
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Theorem 19.1.17. Let A;: Hi H{, j=1,2, be Fredholm operators. Then the 
operator A from Hj} @H2@H\@H+ to Hi\@HZ@Hj{@H? defined by 
(19.1.12) is a Fredholm operator, and 


(19.1.16) ind A=ind A, ind A,. 


We shall end this section by proving a more general version of Theorem 
19.1.15 which is occasionally useful although we shall not need it in this book. 
The main point is to extend the last assertion in Proposition 19.1.13. 


Proposition 19.1.18. Let H, be a Hilbert space and let Tje%,(H,,H)), j=1,2. 
If there exists a closed linear operator S with domain dense in H, and range 
dense in H, such that S is injective and T,S<ST,, then it follows that Tr(T;) 
=Tr(T)). 


Proof. By a classical theorem of von Neumann A=S*S is self-adjoint and S 
is the closure of the restriction to the domain of A. Thus B= A? is a self- 
adjoint operator in H, with the same domain as S, and we have there 
(BS il, =||Sfl|,. Let B® be the spectral projection for B corresponding to the 
interval (¢, 1/e). Since Bf +0 when f+0 it follows that P’f—>f as «0 for 
every feH,. Define P'=SP'S~' on SH, which by hypothesis is a dense 
subset of H,. If g=Sf then 


IF gll=ISES| = (BFS SIBSI =ISF1l=lI1. 
Thus the closure Q of P} is defined in H, and has norm <1; Q?=Q so Q is 
a projection. It is an orthogonal projection, for ||h||<||g+h\| for all h with 
Qh=h and g with Qg=0, so these spaces are mutually orthogonal. We shall 
denote the closure Q also by Py. If g=S/f then Pi g=SP'f, 
BP'f=P°Bf>Bf, «0. 

Hence SR'f is also convergent, necessarily to Sf=g, which proves that 
PY g—g for every fixed geH,. 

Since the range of P° is contained in the domain of S, we have by 


hypothesis 
T, SP! =ST, Pe, 


hence in the domain of S 
TP S=ST, PP. 


If we multiply to the left by P? we obtain 
Py T, Py S=SRP T, FP. 


Since S is a continuous bijection of P’H, on PfH,, with norm <e~’ for 
the operator and its inverse, it follows from Proposition 19.1.13 that 


Tr (P% T, P§)=Tr (f° T, P). 


The proof is now completed by the following lemma: 
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Lemma 19.1.19. Let Te2,(H,H) and let S,, RsjeY(H,H) be sequences con- 
verging strongly to the identity. Then it follows that Tr(S, TR¥)>Tr(T). 


Proof. Since 
IS; TRFI,SC WTI, 

if C is an upper bound for |S, and ||Tj||, it is sufficient to verify the 
statement when T is of finite rank. Then N=Ker T is of finite codimension 
and we can write T as a product T=T7T’P where P: H->H/N and T’: 
H/N—H are bounded. Then S,T'>T’ in £,(H/N,H) since H/N is finite 
dimensional, so 

|S;T—T\,S|Pi |S;T'—T'], 0. 
Similarly, 

| TRF -T||,=||.R; T*—T*|, 79, 
which implies that 

|S; TR¥-Tl, SS, | TRF -—TI],+S8,;T-T], 79. 


Hence Tr(S, TR¥)—>Tr(T), which proves the lemma. 
We can now extend Theorem 19.1.15 as follows: 


Theorem 19.1.20. Let (19.1.8) be a complex of Hilbert spaces and densely 
defined closed linear operators T,. Thus we assume that the range W, of T,_, 
is contained in Ker T,. (We set W)= {0} and Ker Ty= Hy.) Let Rje #%,(H,;,H)) 
commute with the complex in the sense that 


Rj, 7,CT7,R; j=0,....N—-1. 
Then R,;W,-W,, R,Ker T;< Ker T;, and_R, induces an operator R, of trace 


class in "the Hilbert space Aj= =Ker T,/W;. With this notation the invariance 
formula (19.1.10) remains true. - 


The proof is identical to that of Theorem 19.1.15 with Proposition 19.1.13 
improved by Proposition 19.1.18. We leave for the reader to verify that this 
is true. 
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Let X be a compact C® manifold without boundary, and let E, F be two 
C®” complex vector bundles over X with the same fiber dimension. Their 
antiduals will be denoted by E* and F*. We recall from Section 18.1 that a 
pseudo-differential operator Pe ¥"(X ; E@ Qt, F @ Q*), where Q is the densi- 
ty line bundle on X, has a principal symbol 


peS"(T*(X), Hom(z* E, 2* F)) 
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uniquely determined modulo S"~'. Here the fiber of Hom(zx* E,x* F) at 
yeT*(X) consists of the linear maps E,,—>F,, if a is the projection 
T*(X)—X. Also recall that P is elliptic if there exists some qeS~™(T*(X), 
Hom(z* F, 2* E)) such that 


(19.2.1) qgp—IeS~'(T*(X), Hom(x* E, x* E£)), 
pq—leS~'(T*(X), Hom(z* F,x* F)); 


the conditions are equivalent when E and F have the same fiber dimensions. 
By I we have denoted the identity maps in z* E and in x* F respectively. 


Theorem 19.2.1. If Pe ¥"(X ; E@ 2, F @ Q*) is elliptic then P defines a Fred- 
holm operator from H,,(X;E@Q*) to Hy,_»(X;F@Q*) with kernel 
< C®(X; E@ Q?), thus independent of s; the range is the orthogonal space of 
the kernel <C®(X; F*@Q*) of the adjoint P*e¥"(X; F* @Q?, E*@Q?). 
Thus the index is independent of s and equal to the index of P as operator 
C®(X; E@Q?)> C~(X; F@Q*) or D(X; E@Q*)+ D(X; F @M), and it 
depends only on the class of P modulo ¥"-!. If E=F* and P—P*ep"-! 
then ind P=0. 


Proof. Choose Qe ¥~"(X; F @Q?, E®Q*) so that (19.2.1) is valid for the 
principal symbols of P and of Q. Then 


P: H,)(X 3 E@ 2?) > Hy. m(X 3 F @ 0%), 
Q: Hy m(X3 F @ Q*) > H,(X; E@ 2%), 
K,=QP-I: H.(X;E@Q*)> H,, ;(X; E@M?), 
K,=PQ-I: H (X; F @2*)> Hong 1)(X3 F@Q*) 


are continuous for every s. The map H,,,,,7H,, is compact for every s. 
(See Theorem10.1.10 or else use the characterization of H, in Theo- 
rem 18,1.29 and note that symbols of order —1 can be approximated in S° 
by symbols of order — oo. See also Theorem 18.6.6 for a general version of 
the statement.) Hence it follows from Corollary 19.1.9 that P: H,, > His_m) 
and Q: H.,_»)—~H, are Fredholm operators and that ind P+indQ=0. 
Since (19.2.1) is only a condition on the principal symbol, the choice of Q 
only depends on the equivalence class of P modulo ¥"~', so indP is 
independent of P within such a class. Now ueKer P implies 


u= —K,u=...=(—K,)"ueH,,m 


(s—m) 


for every positive integer N. Thus ueC®™. The adjoint P* of P is an elliptic 
pseudodifferential operator mapping Hin_(X, F*@Q*) to Hi_(X, E*@%). 
If v is in the kernel then 


= ea =, *\N 
v=—Kjv=...=(—K3) veH sn) 


for every N so veC™. Since the range of P as operator from H,, to H,,_») is 
closed, it is the orthogonal space of this kernel, which completes the proof, 
for the last statement follows from the fact that ind P= —ind P*. 
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From Corollary 19.1.7 it follows that for the product FP P, of two elliptic 
operators we have 


(19.2.2) ind(B P,)=ind RB + ind P,. 


By Corollary 19.1.6 we have indR=indR if Bew"(X;E@Q?, F@Q*) 
are elliptic and the principal symbol of BR ~P, is sufficiently small in S”. 
A stronger stability property follows from Theorem 19.1.10: 


Theorem 19.2.2. Let I be the interval [0,1] on IR, and let Iatra(t) 
EeC*(T*(X); Hom(n* E,x* F)) and [atreb(t)e C*(T*(X); Hom(zx* F, x* E)) 
be continuous maps such that a(t) is uniformly bounded in S™, b(t) is uni- 
formly bounded in S~™, and a(t)b(t)—I is uniformly bounded in S~1(T*(X); 
Hom(n* F,x* F)) while b(t)a(t)~I is uniformly bounded in S~'(T*(X); 
Hom(n* E,x* E)). If Ag, A,E¥"(X: E@Q?, F @Q*) have principal symbols 
a(0) and a(1) respectively, it follows then that ind Ag=ind A,. 


Proof. Let us first recall how one obtains an operator A(t) with principal 
symbol a(t). (See the discussion after Definition 18.1.20.) Choose a covering of 
X by coordinate patches X,, with local coordinates k;: X; —X jo R", such 
that there are local trivializations 


bj: Ely,2X;x@", by: Fly,7X;xC". 


We can choose the covering so fine that X,U X, is also contained in such a 
coordinate patch whenever X,7.X,+9. Set 


ay(t, K(x), £) = Yi (xdalt,x,'K)(X)E)Oj(x) |, x € X;, gE eR", 
which is then an NxN matrix with entries in S"(X,xIR"), where n of 
course is the dimension of X and N is the fiber dimension of E and of F. 
Choose a partition of unity 1=}\yj on X with y,eCP(X,). Then we can 
define A(t) for ueC@(X, E) by 
A(thu=Vx,W7" K* a,(t,x, D)(Kj7')* (x; 6;u). 


Similarly we define B(t) with principal symbol b(t). To compute A(t) B(t) 
and B(t) A(t) we can work in local coordinates using Theorems 18.1.8 and 
18.1.17. It follows that 


K,®)=BOAH—I, K,(t)=AWBO)-I 


are of order —1 and uniformly bounded as operators H,,. > H,,, 1) for any s. 
Hence Theorem 19.1.10 gives at once that 


ind A, =ind A(0)=ind A(1)=ind A,, 


which proves the theorem. 


To determine the index of arbitrary elliptic operators it suffices to study 
operators with polyhomogeneous symbol of order 0: 
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Theorem 19.2.3. Let Pe ¥"(X ; E@ Q3, F @*) be elliptic, with principal sym- 
bol p, and let h be a homogeneous function of degree 1 on T*(X) which is 
positive and C® outside the zero section. Then the operators with principal 
symbol p(x,Ré/h) are elliptic and have the same index as P when R is 
sufficiently large. 


Proof. First assume that m=0. Choose geS° so that gp—I and pq—I have 
compact support, and set for OSeS1 


p(x, )=p(x,Ex(eh)), — g(x, )=a(x, € x(eh)), 


where y is a decreasing C™ function on R with y(t)=1 when t<1 and y(t) 
=1/t when t>2. By hypothesis we can find C so that pq=I and qp=I 
when h= C. This implies that 


Pe4e—1=pq-I, 4,p,-l=qp—I 


provided that hy(eh)2 C when eh2 1, that is, eC <min,,,ty(t). We have py 
=p so the statement follows from Theorem 19.2.2 if we show that p, and q, 
are uniformly bounded in S° for O<eX1. To do so it is sufficient to 
examine the set where 1<eh<2, for p,=p and q,=q when eh<l, and p,, q, 
are homogeneous of degree 0 when eh22. With the notation e€=n the 
statement for p, is that 


p(x, x (A(x, n))/e) 


is uniformly bounded in C® in the compact subset of T*(X)\0 where 
1<h(x,4)S2. This follows at once from the fact that p(x, 0/e) is uniformly 
bounded in C® when $<h(x,6)<2, because p is a symbol of degree 0. The 
proof is now complete when m=0. 

To handle the general case we choose a C®™ positive hermitian metric in 
E* and write it in the form 


(a(x)v,v), veE*. 


Thus a(x): E*— E, is bijective and its own adjoint. Choose now a pseudo- 
differential operator AeW~"(X; E* @Q?,E@Q*) with principal symbol 
ah~™. By Theorem19.2.1 the index of the elliptic operator A is 0 so 
PAe¥(X; E* @ Q, F @ Q*) has the same index as P and principal symbol 
pah-™, Thus it follows from the first part of the proof that P has the same 
index as Pa if a denotes multiplication by a and P, has principal symbol 


p(x, R o/h) h(x, R C/h)~" = p(x, R C/h) R-™ 
for some sufficiently large R. This completes the proof, for a is invertible. 


If peC°(T*(X)\0, Hom(z* E,x* F)) is homogeneous of degree 0 and 
elliptic, that is, p(x, ¢) is invertible for every (x, )¢T*(X)~0, then we define 
s-indp as the index of the elliptic pseudo-differential operators with prin- 
cipal symbol p. From Theorem 19.2.2 or just from Corollary 19.1.6 it follows 
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that s-ind p; = s-ind p if 
(19.2.3) Sup p(x, 2)" py, €)—II <1, 


the norm being taken with respect to some hermitian metric in E. In fact, 
tp,+(1—t)p is then the principal symbol of an elliptic operator with index 
independent of t when O0<t<i. This observation allows us to define 
s-indp even if p is just continuous; we define s-indp = s-indp; if p, is 
any C® homogeneous elliptic symbol of order 0 satisfying (19.2.3). The 
definition is independent of the choice of p, since (19.2.3) implies that 


sup IIp2(«, €)~* py(x, J—-T <1 


for some py € C®@™ arbitrarily close to p, hence s-indp; = s-indp2. For our 
extended definition we still have s-indp = s-indp; if p and p; are just 
continuous, elliptic and homogeneous symbols of degree 0 satisfying (19.2.3). 

Theorem 19.2.3 suggests a further extension. Let peC(T*(X), 
Hom(z* E,x* F)) and assume that the set of (x,¢) such that p(x,¢) is not 
invertible is compact. If we choose h as in Theorem 19.2.3 then 


Pr(x, 6)= p(x, R o/h) 
is a homogeneous elliptic symbol when R is large, and we define 
s-ind p = s-ind pr 


then; the definition is independent of the choice of R. It is also independent 
of the choice of h, for if h, is another positive continuous function which is 
homogeneous of degree 1 then 


P(x, RE/(th+(1 —t) hy) 


is for large R invertible when € # 0 and 0 St < 1. Thus s-indp does not change 
if h is replaced by hy. 


Theorem 19.2.4. There is a unique function s-ind defined on all p € C(T*(X), 
Hom(x* E, 1* F)) with invertible values outside a compact set, such that 

(i) s-indp = indP if P is an elliptic pseudo-differential operator of order 0 
with homogeneous principal symbol p of order 0; 

(ii) if p, for O St & 1 is a continuous function of t with values in C(T*(X), 
Hom(x* E, x* F)) and p,(x, &) is invertible for all such t except when (x, ) is in 
a compact set, then s-ind p, is independent of t. 


Proof. We have already proved the existence, and the uniqueness follows 
from the homotopy used in the proof of Theorem 19.2.3. Note that Theo- 
rem 19.2.3 states that ind P = s-indp for any P € U"(X;E @ 2, F @ 24) with 
principal symbol p. 


Before proceeding we shall discuss the elementary example where X 
=R/27Z is the circle and the bundles are just X x C. It is then enough to 
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study the operator with principal symbol 
a(x,¢)=a,(x), €>0; a(x, ¢)=a_(x), ¢<0, 


where a, and a_ have values in C~ {0}. We can multiply by the (multipli- 
cation) operator a=! which has index 0 to change the symbol to a, (x)/a_(x) 


when €>0 and to 1 when é <0. If m is the winding number of a, (x)/a_ (x) then 
a, (x)/a_(x) =e 40 


where @ is periodic. Replacing @ by t@ for OSt<1 gives a homotopy 
showing that it is enough to consider the case ¢=0. 
The operator P defined by 


PQ uy, ek) — Y U, eikx 
k20 


which takes the analytic part of the Fourier series expansion of u is a 
pseudo-differential operator with symbol 1 for €>0 and 0 for €<0. In fact, 
if u is a trigonometrical polynomial then 


Pu(x)= lim (27i)-' $6) uy, z*(z—re)dz 
r>1—0 
= lim fu(y/-re@-”) dyn. 
r>1-—90 ‘ 


The limit of 1/(1—e7*e'*) as e> +0 is i(x+i0) plus a term which is in C™ 
in (—72,72). Since i/(22(x+i0)) is the inverse Fourier transform of the Heav- 
iside function this proves the statement. What remains is to ‘determine the 
index of the operator 

ure” Put+tu—Pu. 


Let u, be the analytic function in the unit disc with boundary values Pu 
and let u_ be the analytic function outside the unit disc with boundary 
values u—Pu; it vanishes at oo. If ueC™(X) is in the kernel we have 
2"u,(z)+u_(z)=0 when |z|=1, so the function which is z"u,(z) when 
|z|<1 and —u_(z) when |{z|21 is an analytic function vanishing at 00, hence 
equal to 0, if m=O. If m<O then q(z)=u,(z) when |z|S1 and g(z)= 
~—z~™y_(z) when |z|21 defines an analytic function which is O({z|'"'!~+) at 
infinity. Thus q is a polynomial of degree |m|—1 and conversely every such 
polynomial defines an element in the kernel. Thus the dimension of the 
kernel is 0 when m2=0 and —m when m<0O. 

On the other hand, if feC*(X) and we write f=/,-+/f_ as for u, then 
the equation 


2" u,(z)+u_(Z=f,(2+f_() 


is satisfied by u_=f_ and u,=27~"f,(z) if ms0. When m>0 it is equiva- 
lent to u_=f_ and z"u,(z)=/f,(z) so it can be fulfilled if and only if f,(z) 
=(Q(|z|"). Thus the dimension of the cokernel is 0 and m in the two cases. In 
both cases we find that the index is —m. Summing up, we obtain if a(x, €) is 
a continuous function from X xR to C without zero for large |¢| that 
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s-ind a is the degree of mapping for the mapping from the boundary of 
{(x,€); xeX, |E|<R} to a(x, 2)/la(x, djeS' if R is large enough. Here we 
have oriented the cotangent bundle of X by dx ndf>0. 


The extension of the definition of s-ind to continuous symbols was 
made above for two reasons. Firstly it underlines the topological character 
of this object, which was also seen in the explicit example just discussed. 
Secondly, the product operation which we shall discuss below leads to 
operators which are not quite pseudo-differential but are covered by the 
following theorem. 


Theorem 19.2.5. Let Pe'B%,(X; E@2*, F @Q*) and assume that P—>P in 
L(A (X;E@Q*), Hy, _m(F @Q*)) for some se. Then the homogeneous 
principal symbol p; of P, converges uniformly on compact subsets of T*(X)~0 
to peC(T*(X)\0; Hom(x* E, x* F)). If p is everywhere an isomorphism then 
P is a Fredholm operator from H..(X;E @ 3) to Fy m(F @ 2+) with index 
equal to s-indp (if p is modified in a compact set to a continuous function). 
If the hypothesis is valid for all seIR then Ker P is a subspace of C*(X; E@*) 
and the range is the orthogonal space in Hi_m(X;F@Q*) of a subspace 
of C®(X; F*¥ @Q?). 


Proof. If a€Stng(R” x R") has principal symbol ay, and u, ve CF (IR"), then 
t—™ (a(x, Dy(u e'+#8)), y ef #8)) = (a(x, D +t u,v) > (aol, 6) u,v). 
We have 
£8 cel = (2m) fan)? (t-7 +1 + n/t’ dn)* = [lu EF, 
where the norm is the I? norm. A similar estimate is true for v, so it follows 
ne (dof. 6) u, v)| SM |lul oll, 1el=4, 


if M is the norm of a(x,D) as operator from H, to H,,_»). Thus 
sup |a9|M. If we apply this to @(R—R)w with $,weCo(X) having sup- 
\g}=1 

port in a coordinate patch, it follows that p;,—p,-—0 uniformly on compact 


subsets of T*(X)~0 as j and k- «0, so p=lim p, exists and is a continuous 
function, homogeneous of degree m. 

Assume now that p is invertible. For a moment we also assume that 
s=m=0., Then we have a uniform bound for Bp? on T*(X)~0 when | is 
large. By the second part of Theorem 18.1.15 it follows that we can choose a 
pseudo-differential operator Q, with principal symbol p;*.such that the 
norm of Q; in #(H,,H,) is uniformly bounded if H, =1?(X;E@*) and 
H,=D(X; F @Q*). Now 


Q,P=1+0,(P—P)+(Q;B-D, 
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I+Q,(P—P) is invertible in #(H,,H,) for large j, and Q,P—J is compact, 
so Q;P is a Fredholm operator. Thus Ker P is finite dimensional. Since PQ, 
is a Fredholm operator for similar reasons it follows that P has a closed 
range of finite codimension. Hence P is a Fredholm operator, and for large j 


ind P = ind P; = s-ind p; = s-ind p. 


For arbitrary s and m we choose as in the proof of Theorem 19.2.3 
elliptic pseudo-differential operators 


AeW-*(X; E*@ 04, E@Q+) and BeP-"(X; F @M?, F*@Q*) 


with index 0. Thus B(P.—P)A->0 in A(L?(X; E* @ 2%), P(X; F* @ 2%) so it 
follows from the case of the theorem already proved that BPA is a Fred- 
holm operator. Let A’ and B’ be parametrices of A and of B. Then 


P=B'(BPA)A’ +(1—B’ B) PAA' + P(I—AA’) 


is a compact perturbation of the Fredholm operator B’(BPA) A’, hence also 
a Fredholm operator. For large j the index is equal to that of PB, thus equal 
to s-ind p; = s-ind p. 

If the hypothesis is fulfilled for all s then Ker PH, (X;E@*) de- 
creases with s. If P*: Hm —9(X 3 F* @ Q*) > H_,(X; E* @ 0) is the adjoint 
then dim Coker P is equal to the dimension of Ker P*c Hy, _(X;F * @ Q?) 
which increases with s. Since the index is independent of s it follows that 
Ker P and Ker P* are independent of s, which completes the proof. 


In our application the approximating ‘sequence of pseudodifferential 
operators will be obtained from the following lemma. 


Lemma 19.2.6. Let a(x, y, &)€Sthg(IR"*” x IR") where m>0, and let ag be the 
principal symbol. Choose ye C®(IR"*") so that x(&,n)=1 when |y| <max(1, |é|), 
and x(é,n)=0 when |n| =max(2, 2|é|), while x(€,7) is homogeneous of degree 0 
when |y|>2. Then 


a,(x, y,E,n)=a(x, y, 2) x(E,eneSt,,(R"*” x IR"*"), 


the principal symbol converges uniformly to a, when |é|* +|q|?=1 as e>0, and 
the norm of a,(x, y,D,,D,)—a(x, y, D,) as operator from H., to Hy,_ my tends to 
0 as e-0 for every seR. 


Proof. The principal symbol of a, is a(x, y,) x%o(¢,¢%) if xp is the principal 
symbol of x. Here 7)(&,e4)>7,(¢,0)=1 if e>0 and €+0; since ag(x, y,0)=0 
this proves the convergence of the principal symbols. We have 


(a,(x, y, D,, D,)— a(x, y, D,)) u=a(x, y, D,)v,, 
v,=X(D,,€D,)u—u. 
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For fixed y we can estimate the 2? norm of a(x, y,D,)v with respect to x by 
the H,,,, norm of v with respect to x. Thus 


(a,x, ys D,,, D,)—a(x, y, D,)) ull) 
SCIA+lE?lx(Een)— 17? Wale m)? dé dn 
S |luligy sup CL +16)?" 1x6 en) — 17 (1 +18? + nl?)-™. 


When x(é,e7)+1 we have 1+|é|/? <2|en|?, hence 

(1+ 1E?/1 +161? + Il?) S207 Inl?/ +11? + In?) S20. 
This proves that 
(19.2.4) i(a,(, y, D,, D,)—a(x, y,D,)) ull) SC 2" [lull am: 
Next we shall prove that 
(19.2.4y iI(a,(x, y, D,, D,)— a(x, y, D,)) ull Coe” lullans sy 


if s is a positive integer. To do so we observe that y(D,,¢D,) commutes with 
differentiations while the commutator of a(x, y, D,,) and a differentiation is of 
the same form. Hence D*%(a,(x, y, D,,D,)—a(x, y,D,))u is a sum of terms of 
the form 


(b,(x, y, D,, D,)—b(x, y, D,)) D® u 


with |B|<|o| and b of the same form as a. This gives (19.2.4) at once when s 


is a positive integer. If s is a negative integer and ueH,,,, , we can write 


u= Yo Dru, Yello SC lalla 
jelS—s la] —s 
If we commute the derivatives D* through a(x, y,D,) as above and apply 
(19.2.4), we obtain (19.2.4)' also when s is a negative integer. By Corollary 
B.1.6 this proves the estimate (19.2.4) for arbitrary real s. The proof is 
complete. 


If V, and V, are vector bundles on different spaces X, and X, then the 
vector bundle on X, xX, with fiber V,, ®V,,, at (x1,x,) is denoted by 
V, IV, in the following theorem. 


Theorem 19.2.7. Let X and Y be compact C® manifolds, Ey and F, hermitian 
vector bundles on X, Ey and F, hermitian vector bundles on Y, and assume 
that 


peC(T*(X), Hom(x? Ey,x*F,)), qeC(T*(Y), Hom(x? Ey, x¥ F,)) 
are isomorphisms outside compact subsets Ky and Ky. Then 


dx $ yom) = eaniee p(x, 2)*@1 
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is in 
C(T*(X x Y), 
Hom(zx} E, Xin} Ey On} Fy int Fy, 7% Fy int Ey Ont Ey, Kl nt F,)). 
d is an isomorphism outside Ky x Ky, and 
(19.2.5) s-indd = (s-ind p)(s-ind q). 


Proof. With the notation in Theorem 19.1.17 it follows from (19.1.14) and 
(19.1.15) that A is an isomorphism whenever A, or A, is an isomorphism. 
The finite dimensional case of this result shows that d(x, &, y,4) is an isomor- 
phism unless (x,€) € Kx and (y,n) € Ky. Thus d is an isomorphism outside 
Kx x Ky so s-indd is defined. 

When proving (19.2.5) we may assume that p and q are homogeneous of 
degree 1 and C™ outside the zero section, for p and q are homotopic to 
such functions. Then d is also homogeneous of degree 1 but may not be C” 
at (x,é,y,n) unless both € and n are +0. If we choose pseudo-differential 
operators P and Q with principal symbols p and q this corresponds analyti- 
cally to the fact that 


ed 


is not a pseudo-differential operator. (Here P* and Q* are of course the 
pseudo-differential operators which are adjoint to P and Q with respect to 
the scalar products f,(u,v) or f,(u,v) for sections u,v of E @2} or of F@N}. 
Thus they are not the Hilbert space adjoints if P and Q are considered as operators 
in H(,) spaces.) 


Let 1=) ¢j and 1=) yf be partitions of unity in X and in Y sub- 
ordinate to coverings by coordinate patches where the bundles E and F are 
trivial. Let D, be defined as D with P and Q replaced by 


Po=V OPO}, D=LVjQV; 


which have the same principal symbols. Using Lemma 19.2.6 we can find 

pseudo-differential approximations to ¢;¥,D@;¥, for arbitrary j and k. If 

we multiply them left and right by ¢; and y, and sum, we obtain pseudo- 

differential operators D, of order 1 converging to Dy in Y(H,,,H,,_,,) for 

every s, and with principal symbols converging to d. Thus ind D,»=s—indd 

by Theorem 19.2.5. To compute indD, we only have to determine C” 

elements in the kernel and cokernel. Now the calculation which gave 
(19.1.13) also gives 

PX Py@1+1@QF Q, 0 
DaDo= ( ) 
0 Py PF @1+1@Q_ OF 


and a similar formula for D, D$ with PP, replaced by Py P* and so on. It 
follows that (19.1.14) and (19.1.15) remain true, so ind Dy=ind Py ind Qo= 
(s-ind p)(s-ind g), which completes the proof. 
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We shall now discuss a method which in principle can be used to 
compute s-ind p. in two steps: 

(i) Simplification of the underlying space X to a Euclidean space. 

(ii) Explicit calculation of the index for elliptic operators in a Euclidean 
space. 


In this context we shall only consider elliptic operators which are trivial at 
infinity, that is, are defined by multiplication with an invertible matrix 
function outside a compact set. Step (ii) is the subject of Section 19.3. Step 
(i) begins with embedding the manifold X in a Euclidean space: 


Lemma 19.2.8. For any compact C® manifold X there is a C~ embedding 
@: XR’ for some large enough v. 


Proof. We can cover X by compact sets K,, j=1,...,J, contained in coor- 
dinate patches X, with local coordinates x,: X ;>IR", if n is the dimension of 
X. Choose ¢ Ee Co (X ;) equal to 1 in a neighborhood of K,. Then 


®: X3xro(h (x), O1(x) K1 (>), -.., 60), 6)(x) KX) ERO 


is a C® map with injective differential. If xeK, and O(x)=@(y) then ¢,(y) 
=@(x)=1 so yeX, and x,(x)=x,(y). Thus © is injective which proves the 
lemma. 


By composing ® with a generic projection on a 2n+1 dimensional 
subspace it is easy to reduce v to 2n+1, but this is not important here. 


Let N(X) be the normal bundle of the embedding, 
N(X)= (x, y)eX x R”, B(x) y= 0}. 


Then the fibers of N(X) inherit a Euclidean structure from the Euclidean 
metric in IR’, and the implicit function theorem gives that 


{(x, ye N(X); ly] < p} a(x, ye Ox) +y 


is a diffeomorphism on a tubular neighborhood U, of @(X) if p is suf- 
ficiently small. To accomplish step (i) in the plan above we shall generalize 
the product construction in Theorem 19.2.7 so that applied to an elliptic 
pseudo-differential operator in X it yields a pseudo-differential operator in 
U, which can be regarded as an operator in R* which is trivial outside U,,. 
To do this we need first of all an operator of index 1 which can be defined 
in the fibers of N(X) (compactified at infinity to spheres) with reference only 
to the Euclidean structure but is independent of a choice of basis. Thus we 
must define in R* a pseudo-differential operator invariant under the orthog- 
onal group which has index | and, preferably, a very simple kernel and 
cokernel. 

To construct such an operator we consider first in the case u=1 the 


operators 
P P=x+iD:H,—>H, 
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where H,=T?(IR) with norm denoted by || |, and H, consists of all ueH, 
with xuéeH, and DueHo, the norm being defined by 


(jell? + floc ul? + | Dulj?)*. 


If Pu=0 then (x+@)u=0, that is, u(x)=Ce~*”? which is a function in Hy. 
The range of P is dense in H, for if ve H, is orthogonal then (x —0)v=0, so 
v= C’e*’? which cannot be in Hy unless C’=0. Since 


| Pull? = |xull? + Dull? + (xu, éu)+ (du, xu)= |x ull? + || Dull? — |lul|? 


it follows from Proposition 19.1.3 that the range is closed, hence equal to 
H,. In fact, if u; is a bounded sequence in H, such that Pu, is convergent, 
then |jxu,|| and ||Du,|| are bounded so there is a subsequence u,, which is ib 
convergent. If we apply the preceding identity to u,,—u,, it follows that xu,, 
and Du, are also P? convergent. Hence P is a Fredholm operator with 
index 1. 

The analogue of the preceding example in R” is obtained by considering 
the exterior algebra over €” as a complex with exterior multiplication A(w) 
by w=x+4+ié, 

A(w): 09 CA! C"...+A"C"0. 


This is exact when w+0, that is, the range of each map is the kernel of the 
next. It suffices to prove this when w is a basis vector, and then it is clear 
that w only annihilates the forms where w can be factored out. We shall 
split the complex in the manner described in Proposition 19.1.16, after 
introducing the usual hermitian structure in A/C". Thus we write 


AP=QA7(C”),  A?=QA*+1(€"), 
and when u=(up,u,,...)E A® we define 

p(w) u=(A(w) up + A(w)* ug, .. JEA®; 
then p(w)*: A°>A* has a similar form. By the finite dimensional case of 
Proposition 19.1.16 we know that p(w) is bijective for 0+ we”. (In particu- 
lar, the dimensions of A*® and of A° are equal, which follows already from 
the fact that (1—1)"=0.) Let H, now be the space of all forms of even 


degree 
U=Ugtuz+... 


such that f, D;f, x;feL? for all coefficients f and j=1,...,n, and let Hy be 
the space of all forms of odd degree with coefficients in L’. 


Proposition 19.2.9. The first order differential operator p(x+iD) maps H, 
onto H, with one dimensional kernel generated by the scalar function e~'*!"!?, 


Proof. Let qéC™(R7", (A®, A*)) be equal to p(x +ié)~1 outside a compact 
set. Then geS(R~',g) where 


R(x, = tix? +1677, g=(Idxl? +1dE1?)/R*. 
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This follows from the homogeneity. Hence q(x, D) is a continuous map from 
H, to H,, for q(x, D), x;q(x, D) and D, q(x, D) are in Op S(1,g) and therefore 
LP? continuous by Theorem 18.6.3 or just Theorem 18.1.11. From the general 
calculus of pseudodifferential operators in Section 18.5 we obtain, with 
P(x, 6)=p(x +i), 

q(x, D) p(x, D)=1+ K,(x, D) 


where K,&S(i/R,g). If ueSX’ and p(x,D)u=0 it follows that u= 
(—K,(x, D))" u for every N, so ue XY. We also have 


p(x, D) q(x, D)=14+ K,(x, D) 


where K,eS(1/R,g). Thus K,(x,D), x,K,(x,D) and D,K,(x,D) are con- 
tinuous from H, to H, so K,(x,D) is compact from H, to Hy. Hence the 
range of p(x, D): H, >H, is closed and of finite codimension. If v is orthogo- 
nal to the range then v= — K(x, D)* v=(—K,(x, D)*)" v for any N, so ve. 
Thus it only remains to show that the multiples of e~!*!/? are the only 
forms (with coefficients) in Y annihilated by p(x,D) and that p(x, D)* is 
injective on &. This requires a more explicit calculation. 
If f is a q form in R" with coefficients in Y, then 


A(x +iD) f=<x,dx> a ftdf=en*? diel? f), 
We can write 
f=’ fax! 
where J=(j,,...,j,) is a q-tuple of indices between 1 and n, f, is anti- 


symmetric in the indices, and )” denotes summation over increasing indices 
only. Then we have 


A(x +i D)f =YY" (0;+x)) fydxjadx’, 
75 
Ae + ED) SI? =LY (+x) fC t fd ett 
pld,h 
where e/7=0 unless j¢J, I€L and {j}UJ={NUL; in that case eff is the sign 
iJ 
of the permutation ();). We shall rearrange the terms in the sum. First 


consider those with j=/. For these we must have J=L and j¢J if ef? +0, so 
the sum is 


YY Wes +x)) fill? 
jéd 


Next consider the terms with j+/. If e/7+0 we must then have /eJ and jeL, 
and deletion of | from J or j from L gives the same multi-index K. Since 


iJ — of odlK pljK — _ oJ piK 
frp = Ej “jx fr, = — EK EL > 


the sum of these terms is 


-¥y ((0; +X) fis (Cr +X) fix). 


J*UK 
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A simple calculation which we leave for the reader. gives 
| A(x +i D)*f |? =) YY (01-6) fis 6) - 6) Six). 
dL K 
Since 
[(x;+6,),(x,— 6) J=26 , 
we obtain after an integration by parts 


LAG Dy Ace =) jt O) A? +24 If 1. 
Jj Wl=a@ 


Using (19.1.11) we conclude that if u=> Ug, where wu, is a 2q form with 
coefficients in Y then 


p(x +iD)ull>=9)4q|ur,ll? +> Me; +0) ull’. 
q J 


Thus p(x +iD)u=0 implies that u is a multiple of e~'*!"/?. Similarly we have 
if v=)\v2,,1 where v,,,, isa 2q+1 form with coefficients in Y 
[|p +iD)* oll? =) (4g +2) |lo94 Ml? +d Mj +4) oll?. 
q J 


Thus p(x+iD)* is injective which completes the proof. 


Remark. The proof of Theorem 15.1.1 is essentially the case q=1 of the 
calculation above. 


To continue our program we must modify p(x+iD) to a pseudo-differ- 
ential operator of order 0. 


Lemma 19.2.10. If T,(x, €)=(1 +e x|? +e €|?)-* then T,(x, D) is for small e>0 
an isomorphism of I? on the Hilbert space B of all uel? with DjueL’ and 
x,ueL? for j=1,...,n, and T,{x,D)~' ff in I? as 20 if feB. 


Proof. Let R,=1/T, and introduce the metrics 
B= 8 (dx|? + 1d 2]? R(x, 2) 


which are uniformly o temperate when 0<e<1. It is obvious that R, and T, 
are uniformly bounded in S(R,,g,) and S(T,,g,) respectively, for this is true 


when eé= 1. Hence 
R,(x, D) T(x, D)=1+ K , (x, D) 


where K,, is uniformly bounded in S(e?/R?,g,). Thus the operator norm in 
L’ is O(c”), so 1+.K,,(x, D) is invertible in 2 for small e. Similarly 

T,(x, D)R,(x, D)=I+ K,,(x, D) 
where K,, is bounded in S(e?/R?,¢,). Hence the I? operator norm of 


K,,,[x;,K ,] and [D,,K,,] are O(c’), so the operator norm of K,, in B is 
O(c”). Thus 1+K,, is invertible in B for small ¢, so T,(x,D) is an isomor- 
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phism [?—B and R,(x, D) is an isomorphism B— I?. We have 
T,(x, D)“*f=R,(x, D)(I+K,,(%,D))""f,  feB, 
and (1+ K,,(x, D))-'f—/f in B as «0. Finally, when ge then 
|R,(x, D) g||? =(R,(x, D)* R,(x, D)g, g) 
= ||gl|* +27) Ix, gil? + |D;gll7) + (K3.(x, D)g, g) 


where K;, is uniformly bounded in S(e?,g,) so the norm is O(e*). Hence 
R,(x, D) has uniformly bounded norm as operator from B to L’. If ge¥ then 
R,(x,D)g>g in ¥ as €>0 since R, 1 in C™ and R, is uniformly bounded 
in S(R,,g,). Thus 


T,(x, D)~" f=R,(x, D) f+ R(x, D) (I+ K2,(x, D))"' f—f)>f 


in I? as «0, which completes the proof. 


Fix ¢ now so small that 7,(x, D) is an isomorphism and 
(T,(x, D)~* e~ #772, e F772) 50, 


Then p,(x, D)=p(x, D) T,(x,D) is a surjective Fredholm operator in 
F((R", A*), 2(R",A°)) with index 1 which is injective in the orthogonal 
plane of e~*!"/2, Clearly 


p(x, 6) = p(x, &) T.(x, 0) - i) Op (x, 6)/0 E OT, (x, 2)/6x;. 


To deform p, to a pseudo-differential operator which is trivial at infinity we 
choose dE CP (R") and weC™(IR") as decreasing functions of the radius so 
that 


O(x)=W(x)=1 when |x/<1; 

o(x)=0, w(x)=1/|x| when |x|>2. 
Lemma 19.2.11. If aeS(1, g) then 
(19.2.6) a(x, €)=a(x, (6x) W(6¢) ¢) 
is uniformly bounded in S(1,G) for 0S6 <1 if 

G=|dx|?/(1 +|x]?)+1d 17/0 +141’). 

We have a;(x, €)=a(x,0) if |dx|>2, and as(x, €)= a(x, 6(4x) €/\6€)) if [dE] > 2. 
Proof. The statement means that 

[Dg Df asx, SS Cy g(1 +16) IL + xl). 


In view of the homogeneity in € when |6é|22 we may assume that |6¢é| <2 
in the proof. First we assume that |6x|/ <1. Then a,(x, 2)=a(x, w(6€) é), and 


IDEW(SHESC,S" 1+ pe) MS ca +lEyiM, «#0; 
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it suffices to prove the first estimate when 6=1 and then it follows from the 
fact that w(¢) &€ is smooth and homogeneous of degree 0 for large {é|. Since 
1+ [x] +W(6€)|é[/ 21+ |x}+c|é| the terms in DE DBas(x, €) where a has been 
differentiated j+|6| times can be estimated by a constant times 


(1+ +16) FIL + [eyo s CH + fx) Pl + ie 


as claimed. When |5x|>2 we have a,(x, €)=a(x,0). The assertion is obvious 
then so we now assume that 1<|éx|<2. We can then estimate a derivative 
of a of order j by Cé/, and 


[Df Dz (5x) (58) 6E|S Cy g(1 + xl) A + 1G) 


since this is true when 6=1 and 6/14 |6x|)<1/(1+|x|). The required es- 
timate is an immediate consequence. 


We have now developed all the tools required to construct the operator 
to be used in the generalized product formula: 


Theorem 19.2.12. There exists a symbol BeS$,,(R" xR") with values in 
L£(A*(R"), A°(IR")) such that 
(i) B(x, €)= A(x/|x|) + A(x/|x|)* if |x| is sufficiently large. 

(ii) The kernel of B(x, D) minus B(x,0)6(x— y) has compact support. 

(iii) B(x,D) maps I7(R", A°(0R")) onto L7(IR", A°(IR")) with one dimensional 
kernel < CP? (R", A°(R")) consisting of rotationally symmetric scalar functions. 

(iv) O* B(x, D)u= B(x, D)O*u if O is an orthogonal transformation in IR" 
and ue (IR", A“(R")). 


Proof. Choose ¢ so small that the conclusions of Lemma 19.2.10 are valid 
and the kernel of p,(x,D) is not orthogonal to e~'!’/?. Next we apply 
Lemma 19.2.11 to a=p, and to b=p;' outside a compact subset M 
={(x,2);|x|SC,|44SC} of IR". Then a,(x,é)b,(x,é) is the identity if 
(x,O€M and 26CK1. In fact, if (x, d(6x)W(6H HEM then |dx}/Sd6CK1, 
hence W(6€)|dé|S 5 C S54 so [6E|<1 and |é| SC. Now 


bs(x, D) as(x, D) =I te K, (x, D), 
s(x, D) (x, D)= I + K7(x, D) 


where K,, are bounded in S((1+|x|)~*(1+]é])"',G) for small 6 if G is 
defined as in Lemma 19.2.11. Hence it follows from Theorem 19.1.10 that 
inda,(x,D)=1 and that dimKera,(x,D)=1 if 6 is small enough. If 
u;éKera,(x,D) and |ju;|=1 then u,;=—K,,u, belongs to a fixed compact 
subset of 7. Any limit point u, when 6-0 is in the kernel of p,(x,D) and 
has norm 1, and is therefore not orthogonal to e~'*!"/?. If dO is the Haar 
measure in the orthogonal group and 


U,;={0*u;dO 
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then U, is also in the kernel of a,(x,D) and has the same scalar product with 
e~!*!/2 as u., so U,+0 for small 6. Thus we have a generator for the kernel 
which is invariant under the orthogonal group, which means that it is a 
rotationally symmetric function. 

The kernel K of a,(x, D) is as(x,0)6(x — y) if |6x|>2. Now 


K(x, y= Qn)" fel” a,(x, C)dé 


is rapidly decreasing when |6x|<2 and yoo (see the proof of Theorem 
7.1.22). It follows that 


(K(x, y)— a(x, 0) d(x—y)(1—(y y)) > 0 
in ¥ as y-+0, if @ is defined as in Lemma 19.2.11. Hence the operator 
Bo(x, D) = a,(x, D) —(as(x, D) — a5(x,0)) (1 — P(y x)) 


which clearly has the property (ii) will also satisfy (iii) if y is small enough. 
When |6x|>2 we have 


B(x, 6) = p(x, 0) T,(x, 0)—i ¥ A p(x, 0)/0 € , OT, (x, 0)/0x; 
= p(x, O)/Jex{+OU/\x|) as xo. 


If 6 is a sufficiently small positive number then 


B(x, 6) = (0 x) (1 +67(|x]? + b(5x)?/57))? ([x1? + (5x)?)” * Bo(x, 6) 
+(1 — (8x) p(x, O)/|x1 


will remain a function of x with invertible values if |6x|>2, so (i) is fulfilled 
and B inherits properties (ii), (iii) from By. Rotational symmetry has been 
respected in the entire construction so (iv) is valid, which completes the 
proof. Note that the factor of B, above was chosen so that the principal 
symbol is 


(i) p(x +4 G(5.x) £/|6 EI) (Ix1? + G(5x)?)-*. 


The operator B(x, D) is convenient to use analytically, and the symbol is 
homotopic to p(x+ié) outside a compact set. We shall call B the Bott 
operator since the symbol is a generator in the Bott periodicity theorem 
which will not be used here though. For technical reasons it is preferable to 
regard B as an operator in S"=R"U{oo}, with the local coordinates x f\x\? 
at infinity. B maps sections of the trivial bundle E,=S" x A°(€") to sections 
of the bundle F, which is equal to IR” x A°(C") over IR" and to (S"~ {0}) 
x A*(€") over S"~ {0}, with (x, p(x/|x|)w) and (x, w) identified if xeIR"~ {0} 
and weA*°(€"). With the first representation the definition of Bu is un- 
changed if ueé’(IR"), and with the second one the operator B acts as the 
identity if suppu is sufficiently close to oo. The compatibility of these 
definitions follows from condition (ii) in Theorem 19.2.12. 

We are now prepared for the proof of an index formula for the fiber 
products which occur in connection with an embedding. The statement is 
long but one should note the analogy with Theorem 19.2.7. 
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Theorem 19.2.13. Let Y be a compact C~ manifold, Ey and F, two complex 
C™ hermitian vector bundles over Y, and let 


qe C(T*(Y), Hom(a} Ey, nf Fy)) 


be an isomorphism outside a compact set. Let V be a real vector bundle of 
fiber dimension n over Y, endowed with a Euclidean structure. Let V, be the 
sphere obtained by adding a point at infi inity to V,, and let V be the sphere 
bundle over Y with these fibers. Thus V=VU(Yx foo}) with the C™ structure 
at infinity defined so that the inversion V,~ {0}Svrov/|v|? extends to a diffeo- 
morphism V\.V°>V if V°=Yx {0} is the zero section of V. Denote by Ve the 
complexification of the vector bundle V, and let Ey be the lifting from Y to V 
of the vector bundle A*(V¢). Let Fs be the vector bundle on V obtained from 
the lifting of A°(V¢) to V and the lifting of A°(Ve) to VN V°® by identification 
of (v, p(v/|v|) w) and (», w) if ve V,~ {0} and we A(V,g). Here p(v)= A(v)+ A(v)* 
as above. Let Ey, F, be the bundles Ey, F, lifted to V, and choose 


ge C(T*(V), Hom(ay Ey, nF Fy) 


so that with x denoting the projection V-> Y 


(19.2.7) G(t,x* n)=q(nt,n) if teV and neT#(Y). 
Now introduce for (t,t)eT*(V), € denoting the restriction of t to the fiber, 
deja (PPEH OOO! Oa) 
, 1@q(t,t) w(x) p(x+ic d(x)* @! 


where x is t regarded as an element of V,, and w(x), @(x) are decreasing 
continuous functions of |x| which are equal to 1 at 0 and equal to 1/|x| resp. 0 
in a neighborhood of «. Then 


de C(T*(V), Hom(nj E,® Ey ® Fz® Fy, ny F,@ Ey ®E,@ Fy) 
is an isomorphism outside a compact set, and 


(19.2.8) s-indd = s-ind q. 


Proof. First we recall that (19.1.12) is an isomorphism whenever A, or A, is 
an isomorphism. Now w(x) p(x+ié@(x)) is an isomorphism unless x= ¢=0, 
thus reV° and ft is orthogonal to the fiber V,,. But that means that t=2* 7 
for some ne T,*(Y), so it follows in view of (19.2.7) that d is an isomorphism 
outside a compact set. Thus s-indd is defined and is independent of the 
choice of @ satisfying (19.2.7) and also of the choice of @ and of w. When 
proving (19.2.8) we can also assume that q is homogeneous of degree 1 and 
C” outside the zero section of T*(Y). Choose Qe PY Ey @ 3, F, @ 9?) 
with principal symbol q. To change the Bott operator B to a first order 
operator we choose a pseudo-differential operator T in S” such that the 
principal symbol at a cotangent vector (x, €) is its length with respect to the 
standard arc length in S". (This is |dx|7/(1+|x{|?)? in IR" identified with 
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S"~ {co} so the principal symbol of T restricted to T*(IR") is |€{(1+|x|7).) 
Adding a large positive constant to T we can make T invertible, hence of 
index 0. The operator T also acts on sections of the trivial bundle E, on S$”, 
and B,=BT is now a first order elliptic pseudo-differential operator with 
kernel generated by a rotationally symmetric scalar function if T is chosen 
invariant under the orthogonal group. It is clear that B, also acts on 
sections of E,® Ey @Q? for they can locally be considered as forms in R" 
depending on the parameter y and with values in E,@Q}, and B, is 
invariant under the orthogonal group. (In the spherical fibers we identify 
functions with densities.) 

As in the proof of Theorem 19.2.7 we take a partition of unity 1=) yf 
subordinate to a covering of Y with coordinate patches Y, where Ey and Fy 
are trivial and V with its Euclidean structure is equivalent to Y,x IR". In 
terms of these trivializations ¥,Qw, defines an operator from sections of 
E,®E,@Q} to sections of E,® Fy @Q} which can be approximated by 
means of Lemma 19.2.6. After multiplication left and right by w, we can 
pull the operator back to the manifold. Summation over k gives an operator 
O= YO, from sections of E p@Ey @ 23 to sections of E p@F, @ 2 such 
that 


(19.2.9) O11 @n* u)=!1 @n* Qou, 


if u is a section of Ey@Q} on Y. Here Qp>=) WZ Qw? is another pseudo- 
differential operator in Y with principal symbol g. When we apply Lemma 
19.2.6 to W,Qy, we obtain pseudo-differential operators converging to O, 
with principal symbols converging to wi q(xt,n) at (t,2* 7). (It is extended 
from there so that it is constant in the directions conormal to the plane x 
=constant defined in terms of the local trivializations, but this information 
does not survive when we form Q.) Thus the principal symbols of the 
operators approximating O converge to some @ satisfying (19.2.7). In the 
same way we can also lift Q to an operator Q, from sections of 
F,® Ey@Q} to sections of F,@F,@Q}, and we have the intertwining 
properties 


(19.2.10) on B,=B, 0, BY 0, =OBF, 
since they are true for the local constructions. 
Now consider the operator 
D= ie - | 
Q BF 
from sections of Ez, @Ey® a OF, @Fy@ oF to sections of F, @ Ey @ ai OE, 
@F,@Q}. Just as in the proof of Theorem 19.2.7 we can now apply 
Theorem 19.2.5 and conclude that D is a Fredholm operator whose index 
can be computed on smooth sections, for the limiting principal symbol 


—A* 
(19.2.11) es = 
q b, 
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is an isomorphism outside the zero section of T*(V). In fact, the principal 
symbol b, of B, is an isomorphism at (¢,t) except when t is conormal to Vor, 
and then it follows from (19.2.7) that G(t,t) is an isomorphism, when +t +0. 
From the intertwining property (19.2.10) it follows that 


BY B,+0*O 0 

0 B, BR+Q, QF) 
Here B, B* is injective, and the elements in the kernel of B*B, can be 
written in the form uo(|x|)@v(zy) where uy is a scalar function of the 


Euclidean metric and v is a section of E,@Q%. But the proof of (19.2.9) 
gives 


| 


Oly @1* v)=uyp @n* Qov, 
so the kernel of D is isomorphic to that of Q,. Similarly, 
B, B¥+Q*Q, 0 
0 Bt B,+00*)" 
Here B, Bf is injective so we see as above that the kernel consists of all 
Uy ®n* v, where v is a C™ section of F,®@Q3, annihilated by O*. Since O 
preserves the degree of forms along the fibers of V this means that uy @n*v 


must be orthogonal to Ow for every section w of Fy @ 3 regarded as a 
subbundle of E pF @. If j denotes integration along the fibers then 


fuy Ow=Q, J uy w 


by the definition of QO. (Cf. (19.2.9).) Hence the condition on v is that v is 
orthogonal to the range of Qo, that is, Q% v=0, so we have 


pp*=( 


ind D = dim Ker Qo — dim Coker Qp = ind Qp = s-indq. 


It remains to establish a homotopy between the symbol (19.2.11) of D 
and d(t,t). The construction of B, gives 


by =|d1, P(x +8 6(6x) Ed EI AIxI? + P(Sx)")7, 


wjere 6 is obtained from Lemma 19.2.11 and |é|,=(1 +|x|?)|é| is the length 
of € with respect to the spherical Riemannian metric. By the homogeneity 
of p this is equal to 


P(x|E], +4 G (5x) (1 +|x17) €/5)/(lx|? + G(5x)’)?. 


In d we have at the corresponding position W(x) p(x +i o(6x)¢). For O<A<1 
a connecting homotopy of isomorphisms for +0 is given by 


PULA) [El +A) x +EG(Sx) EL + 1179/9)! VA WO) + (1 — AMX? + 6 (6)7)*). 


This completes the proof. 
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One can also make d quite trivial in a neighborhood of infinity by 
considering for O0<6<1 the homotopy 


4 ten ee ks — $(6x) I @ q(t, r)* 
meme (6x) I @ Gt,7) W(x) p(x +ic O(x))* QI)” 


This is still an isomorphism when x+0 or +0, and when x=0 we have 
changed nothing. Thus s-ind d = s-ind d, = s-ind dp. Note that in the neighborhood 
of co in V where ¢(x) = 0 we have 


_ppoxv@r 0 
dol ee pccixpteot) 


As in Theorem 19.2.3 we can choose a pseudo-differential operator of order 
0 with a homotopic principal symbol such that the kernel of the operator 
differs from the kernel of a multiplication operator by a kernel of compact 
support in Vx V. This means that the kernel and cokernel of the operator 
can be computed on sections with compact support in VY, so the extension 
by the section at infinity is no longer relevant. To simplify the operator 
further we shall trivialize the bundles: 


Lemma 19.2.14. If E is a C® complex vector bundle over a compact C” 
manifold X, then one can find another such bundle G such that E®G is 
isomorphic to X x C% for some N. 


Proof. This is very close to the proof of Lemma 19.2.8. Choose a covering of 
X by coordinate patches X,, j=1,...,J, such that for every j there is a 
trivialization ,: E\y, > X,x C’. If ¢,¢Cy(X,) and }'¢,=1 then 


Ese->(ne, Wi Py €,..-,WyPyejex x cy, 


where x is the projection E— X, is an embedding of E as a subbundle of X 
x €", N=Jv. Now we just have to take for G the bundle for which G, is 
the orthogonal space of the image of E, in C*. The lemma is proved. 


If we apply the lemma with X replaced by V and E replaced by 
E,@ Ey @F,@ Ff, it follows that the direct sum of the identity in C*(V,G) 
and the operator constructed just before Lemma 19.2.14 is an elliptic opera- 
tor D, on sections of the trivial bundle Vx €* which is just multiplication 
by a vector bundle isomorphism in a neighborhood of oo. 

Suppose now that V is the normal bundle of an embedding of Y in R’ 
for some v. As observed after Lemma 19.2.8 we can then identify V with a 
tubular neighborhood U, of Y, and D, defines an elliptic pseudo-differential 
operator on U, with symbol dg wich is just a bundle isomorphism on 
U,~ U, for some r<p. For a fixed 2+0 in RR” the symbol d,(X, 4) gives an 
isomorphism of ©’ and the fiber of the range bundle at X, so this is also a 
trivial bundle. Thus D, is a NxN matrix of pseudo-differential operators 
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reducing to multiplication by an invertible matrix function in U,~ U. for 
some r<p. To extend it to all of R” we need the following lemma: 


Lemma 19.2.15. Let UCR” be open and bounded, let K <U be compact and 
aeC”~(U~ K,GL(N, €)). Then one can find : 


A,eC®(U,GL(N, ©), A,,¢C*(R’S K, GL(N, C)) 


such that A,,=aA, in UXK and A,, is homogeneous of degree 0 in a 
neighborhood of ©. 


Proof. Choose ¢eCZ(U) with OS <1 and ¢=1 in a neighborhood of K. 
Then 
E={(x,2,,2,)ER” x (C* ®C"); a(x)(1— (x) 2, = b(x) zy 
if xeU, z,=0 if x€U} 


is a vector bundle over R’, hence trivial. The proof goes as follows. If 
4<t<1 then the orthogonal projection from the fiber E,.< C? to E,, is the 
identity map for large x or t=1, hence an isomorphism for all x if 1—t is 
small. This defines an isomorphism P(x): E,-E,, depending continuously 
on x. If we fix a large ball B containing supp ¢ then 


P(t"! x)... P(x): E, > Ex, 


trivializes E over B if k is so large that E is trivial over t* B. Thus we can 
find C® maps F and G from B to Nx N matrices such that E restricted to 
= {(x, F(x) w, G(x) w); xe B, we C%}. 
Thus a(x) (1—(x)) F(x) = $(x) G(x) if 0< <1. Set 
Ao(x)=F(x)/@(x) when (x)>0; 
A,(x)=G(x)(1—¢(x)) when ¢(x)<1. 


Then a(x) Ap(x)=A,,(x) when 0<¢(x)<1 so we can extend A, to U by 
defining A,(x)=a(x)~'A,.(x) when $(x)=0 and extend A,, to [K by defin- 
ing A,,(x)=a(x)A,(x) when (x)=1. Then aA,=A,, in UXK. Since U is 
bounded we can take B={xelR’; |x| <2R} with R so large that |x|<R/2 in 
U. With w as in Lemma 19.2.11 we may change the definition of A,,(x) to 
A,,(/(x/R)x) to achieve the desired homogeneity, and this completes the 
proof. 


Now we just replace our operator D, by Dg A, and obtain an operator 
acting on © valued functions which is just multiplication by A,, outside a 
compact subset of U,. Thus it can be extended to all of R”. Summing up, 
starting from an elliptic symbol g in Y we have defined an elliptic Nx N 
system of pseudodifferential operators in IR” which outside a compact subset 
of R” is just multiplication by an invertible matrix D(x), homogeneous of 
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degree 0. For this operator acting on functions with compact support the 
index is equal to s-indg. In Section 19.3 we shall determine the index of 
such operators explicitly. This will in principle yield a method for comput- 
ing s-indq. References to completely explicit formulas for s-indq will be given 
in the notes, but it would take us too far from the main path to develop these 
fully here. 


19.3. The Index Theorem in IR" 


As in Section 19.2 we shall denote by g the o temperate metric in R*” 
defined by 


Bx2=h(x, )(\dxi? +\de)7), h(x, G)=(1 + |x)? +12)". 


The symmetry in the x and € variables and the use of the Weyl calculus 
simplifies the proof of the following theorem: 


Theorem 19.3.1. If aeS(1,g) has values in L(C’,C”) such that a~' exists and 
is bounded outside an open ball BcIR?" then a”(x,D) is a Fredholm operator 
in 7 (R",C”) with index 


(19.3.1) inda”(x, D)= —(—2ni)-"(n—1)!(2n—-1)! J Tran? day", 
OB 


if R?" is oriented by dx, Adé, A... \dx,Adé,>0. 


Here a~'da is a one form with coefficients valued in v x v matrices, thus 
non-commuting. If n=v=1 then a is a scalar function and the right-hand 


side is 
(2ni)~*  da/a, 
2B 


that is, the winding number of a considered as a map from 6B to C~ {0}. 
(See the example after Theorem 19.2.4.) To prove Theorem 19.3.1 we shall 
use the trace formula in Proposition 19.1.14. First recall that A is a Hilbert- 
Schmidt operator in 7(IR") if and only if the kernel «/ of A is square 
integrable, and that 

JAIZ=Jlo(x, yl? dxdy. 


If A=a™(x,D) this means by Parseval’s formula and (18.5.4) that acl? and 
that 
la" (x, D5 =(22)~" Sf lalx, OI? dx dé. 


(This holds for a(x, D) as well.) If 7, /,¢17(R") then the kernel of of the 
composition A of the corresponding operators is given by 


A(x, y)= | A, (x, z) H,(z, y) dz, 
and the trace of A is [{ ,(x, z) H(z, x)dxdz. Now 
A(x,x+y)= fA, (x, 2) X,(z,x+y)dz 
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is by Fubini’s theorem in L’ as a function of x and is a continuous function 
of y with values in L! since 


ft,(z,x+y)—,(z,x+y))? dxdz 


tends to 0 with y—y’. Hence the trace of A is { o(x,x)dx if the kernel of 
A is continuous. This is true if A4=a™(x, D) is of trace class and the symbol 
is integrable, so then we have 


Tr(A)=(22)-" ff a(x, &)dxdé. 


If the values of a are vx v matrices then A is an operator in -(IR",C’) and. 
the formula remains valid with a replaced by the trace of a as a matrix in 
the right-hand side. We leave the verification as an exercise and pass to a 
very elementary sufficient condition for an operator to be of trace class. 


Lemma 19.3.2. a”(x, D) is of trace class in I7(IR") and 
(19.3.2) la”*(x, D)||, $C bat Ix" EF D® DE alle 


Jaj+...+ |B] Sn41 


if the right-hand side is finite. 


Proof. From the operator calculus we obtain 
h(x, D) (1 +|x|? +|D|?) +c"(x, D)= 1 


where c= Ah/4eS(h?,g) is also a function of |x|*+ |é|?. Hence 1+|x|?+|Dj? 
commutes with h” and c” since the Poisson bracket of functions of |x|? +[é|? 
is 0. Raising the identity to the power k where 2k2=n+1 we obtain 

i 


> bY (x, D) (1 + |x|? +|D)?¥=1 
0 


where b?(x,D) is a sum of products of j factors h”(x,D) and k—j factors 
c”(x,D) in some order, hence b,eS(h7*~/,g). If we multiply to the right by 
a”(x,D) and multiply the largest possible number of factors x and D less 
than or equal to n+1 into a”(x, D) then we obtain with summation for |a’} 
+...+|p'|sn+1 
a” (x, D) =) OY a’, pr, prl% D) aya’, p’,p(% D), 
where 
de oy plG Oa CODe De dae) 

while by 4,p-,5 is independent of a and is in S(h”*+1’?, g) since at most 2k 
—(n+1) factors x and D will be left as right factors of b,, and b,eS(h*, g). 


Thus 62. «,g,g” 18 a Hilbert-Schmidt operator which completes the proof of 
the lemma. 


Proof of Theorem 19.3.1. Choose weC™(IR?") so that 1—w has compact 
support and a is invertible in suppy. Set b=ya™', defined as 0 outside 
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suppw. Then beS(1,g) and ba=ab=wWI where I is the identity matrix. 
Hence 


b”’(x, D)a”(x, D)=1—RY(x,D), a(x, D) b” (x, D)=1 — R¥(x, D) 


where I is now the identity in H=I7(R",C’) and R j©S(h, g). The symbol of 
RY (x, D)® is therefore in S(h%,g) so it follows from Lemma 19.3.2 that 
R(x, DY is of trace class if N2n+1. Using Proposition 19.1.14 we con- 
clude that a”(x, D) is a Fredholm operator in H and that 


(19.3.3) ind aY (x, D) = Tr(RY (x, D)Y — RY (x, D)") 


To derive an explicit formula like (19.3.1) from (19.3.3) we must place 
ourselves in a situation where the asymptotic expansions of the calculus of 
pseudo-differential operators give a precise result in the whole space. This 
can be done by considering for 0<e<1 the symbols 


a,(x,C)=alex,e¢), (x, ¢)=b(ex, € ¢) 
which are uniformly bounded in S(1,g,) if 
g.=hx, 6) (ldx|? +|dél?), h(x, d=? Ale x, ¢ €). 
Note that g,<g since h,<h. If as before we define 
RY (x, D)=1—b?(x, D) af’ (x, D), %) (x, D)=I— a(x, D) b*(x, D) 


then R,,;-(1—w,)I is uniformly bounded in S(h,g) when 0<eS1 if y,(x, ¢) 
=w(ex,eé). When ¢ is bounded away from 0 then 1—y, also has such a 
bound so it follows that the symbol of R¥(x, D)* is uniformly bounded in 
S(h®, g) and a continuous function of ¢ with values in C°(R2"). Since 


(19.3.3) ind aY (x, D) = Tr(R% (x, DY" — R%(x, DJ") 


and the right-hand side is a continuous function of ¢ by Lemma 19.3.2 while 
the left-hand side is an integer, it follows that ai’(x, D) and a(x, D) have the 
same index for every e€(0,1). (This is also an easy consequence of Theorem 
19.1.10.) 

For 0<e<1 we now introduce 


C¥¥(x, D)= R¥ (x, D)® — R%(x, Dy’. 
If we compute the right-hand side by means of the asymptotic formulas in 


Theorem 18.5.4, breaking off where the error term becomes bounded in 
S(h®, g,), then the trace of the error term is bounded by 


CifatdxdE=Ce?®—" [{hXdxdé>0 as 0. 


(Recall that N>n.) If S, is the finite sum of terms kept in the asymptotic 
expansion of C, it follows that 


(19.3.4) ind a(x, D)=lim (22)-" [[ Tr(S,(x, ©) dx dé, 
e>0 
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where Tr is now just the trace of a matrix. Every term in S, is a product 
where every other factor is a derivative (possibly of order 0) of a, and every 
other factor is a derivative of b,. If we introduce ex and e€ as new 
integration variables in (19.3.4) then every term containing y differentiations 
in all becomes a constant times e“~?". Hence the contributions to the 
integral when ~<2n must add up to 0 and 


(19.3.4y ind a(x, D)=(2)~" [ Tr(S2(x, )) dx dé, 


where S° consists of the terms in S, where exactly 2n differentiations occur. 
The right-hand side is independent of ¢ so we can take e=1 and write S° 
instead of S° in what follows. 


If TeGL(2n,IR), the group of invertible linear transformations in R2", 
and A,=aocT, then A,eS(1,g) so the preceding discussion can be applied 
also to A,. It follows that ind A*(x, D) is a continuous function of T, hence 
constant in each component of GL(2n, IR). Thus 


ind A¥(x,D)=inda”{x,D) if det T>0. 


In particular we can apply this to diagonal matrices with positive diagonal 
elements, 


T(x, €)=(€1 X4,---58 n Xqr4 CigeengO4 6): 


If we replace a and b by the composition with T and then introduce ¢;x, 
and 6;¢; as new integration variables, we find that the contribution to 
ind a”(x, D) from a term in S° containing differentiations of order o,,...,,, 


B,,---5B, With respect to x;,...,X,, 1,-.-5€, is multiplied by e%7!... 68> 
Hence 


(19.3.5) ind a”(x, D)=(2n)-"{{ Tr(S'(x, d))dxd€, 


if S* denotes the terms in S° containing precisely one derivative with respect 
to each variable x,,...,X45€45-++5 En 

An immediate consequence of (19.3.5) is that the index of the operator 
A¥(x,D) above is equal to inda”(x,D) times the sign of detT for every 
TeEGL(2n,R). In fact, (19.3.5) proves this for arbitrary diagonal matrices, 
and the assertion has already been proved when det T>0. In particular we 
can apply the result when T is the linear transformation corresponding to 
an arbitrary permutation of the x, € variables. 

Returning to the integration variables T(x,¢) we conclude that if 
Si(t,x,é) is obtained from S*(x,é) by making the permutation rt of the 
differentiations 6/0x, and 0/0¢, then 


sgn tind a”(x, D)=(2x)-"{{Tr(S'(t,x, ))dxdé. 
Hence it follows that 


(19.3.6) ind a”(x, D)=(2n)-" {[ Tr(S2(x, Q)dxdE((2n)! 
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where 
S?(x, )=Y sent $(t,x, 8). 


The alternation which occurs in S* eliminates all terms containing a factor 
differentiated more than once, for only sgn is changed if we exchange the 
corresponding differentiations. Thus S? is a sum of products of derivatives 
of a and b of order less than or equal to 1, a derivative with respect to each 
variable occurring exactly once. To compute S? we can now return to the 
calculus formulas discarding all terms containing a derivative of order 
greater than 1. In the expansion of the symbol of b”(x,D)a”(x,D) it is 
therefore sufficient to keep the terms 


ba—i{b,a}/2=wI—i{b, a}/2, 
and in the expansion of the symbol of a”(x, D)b”(x, D) we just keep 
ab—if{a,b}/2=w wl —i{a, b}/2. 
Note that since matrices do not commute, the Poisson bracket 
(19.3.7) {b,a} =) (@b/0€ ,0.a/0x;—6b/éx,0a/0€)) 


does not only differ in sign from {a,b}. In the symbol series for C, we shall 
never keep terms where {b,a} or {a,b} is differentiated. The terms we keep 
will therefore be products of factors {b,a} or {a,b} and terms from the 
expansion of the symbol of a power of w(x, D), for all factors involving w 
can be moved together since w is a scalar. However, since {w,y}=0 there 
. are no such terms where yw is differentiated only once. It follows that in C, 
we need only consider the terms 2 


(1 —W) I +i fb, a}/2)" — (1 —) I —ifa, b}/2)". 


The terms where 2n derivatives occur are 


(7) avo" na72yr,0}"— (4,5) 


If we introduce the explicit definition (19.3.7) of the Poisson bracket we find 
that {b,a}" contains 2"n! terms where precisely one derivative occurs with 
respect to each variable. They occur with the same sign as coefficients of 
d&,Adx,a...adé,Adx, in (dbada)". If we perform the permutations ¢, 
multiply by sgnt and sum then all terms are obtained once. This proves 
that 


Sab pAaX; Kons nde, adx,=nii"(™) (1—w)*— "(db a da)" —(daa db)’). 
Hence we obtain 


(19.3.8) ind a”(x, D)=(n!/(2n)!)(i/22)" {20 —wyr" (") Tr(db a da)" 
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where the integration is made over IR?” with the orientation given by 
(19.3.9) dE, Adx,a...adé,adx,>0. 
Here we have used the fact that 

Tr(daadb)"= —Tr(db ~ da)’, 


since the trace of a product of matrices is invariant under circular per- 
mutation whereas moving an exterior one form from the right to the left 
through a 2n—1 form will change the sign. 

Now we introduce the definition b= wa 


db=(d)a~'—wa-'(da)a“', dbada=(dW)na~' da—wW(a"' da)’. 


~1 and 


When forming the n'® power we can only use one factor dw since di (anti-) 
commutes with the other factors and dy adi =0. Moreover 


(19.3.10) Tr(a~! da)’"=0, 
for moving a factor a~'da from the extreme right to the extreme left 
introduces a factor —1 without changing this quantity. By the invariance of 
the trace under circular permutations of factors of even exterior degree, we 
conclude that 

Tr(dba day=nw"! dy(—-1)l"! aATr(a—* day"). 


The form Tr(a~! da)*"—! is closed by (19.3.10). If we set 
i (N 
FT ( )a—s)"nsttas 
9 \n 


N 
we obtain F()=( ) mBEN—n+1,n)=1 (see (3.4.9)), hence by Stokes’ for- 
mula ” 


(19.3.1) ind a”(x, D)= —(2mi)~"(n—1) !/(2n—1)! | Tr(a da)?" 
6B 


where B is a ball so large that a(x, €) is invertible in its exterior. We recall 
that IR?” has here been oriented by (19.3.9). Choosing the standard orien- 
tation used in (19.3.1) introduces a factor (—1)" so (19.3.1) and (19.3.1) are 
equivalent. The proof is complete. 


The proof of (19.3.10) shows that the differential form Tr(A~1dA)*" on 
GL(v, C) is equal to 0, hence that Tr(A~1dA)?"~* is closed. This implies that 
the integral in (19.3.1) only depends on the homotopy class of the map 
0B—GL(, C) defined by a. We shall use this homotopy invariance together 
with the stability properties of the index to extend Theorem 19.3.1 to sym- 
bols in S(1, G) where 


(19.3.11) G=|dx|?/(1 +[x]?)+|d E71 +4”). 


Lemma 19.3.3. Let weCZ(IR2") be a decreasing function of r=(\|x\? +|é|’)? 
which is 1 when rS1 and i/r when r22, and set W,(x,O=wWlex,eo). If 
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aeS(1,G) then 
a(x, €) = a(w,(x, €) x, W(x, ¢) ¢) 
is uniformly bounded in S(1,G) when OS and belongs to S(1,g) if0<eS 1. 


Proof. When rS1/e we have a,(x, )}=a(x, €) so we may assume in the proof 
that r>1/e. Then we have 


(19.312) (L+|xl/er)~*|DEDL W(x, xs SCp(1 + bl Md +e) 


and an analogous estimate with x and € interchanged. To prove these 
estimates we observe that w, is uniformly bounded in S(1/(1+ er), g). This 
gives the estimate 


Cyl +|x\/er)~ 1 fer) 21 4 r)~ Fl] 4 4 rt et Phy 
S Cyl tr) Pl 1 + xt er) Fh, 


The second term can be omitted if B=0 so this proves (19.3.12). The lemma 
follows at once from (19.3.12) and its analogue with x and é interchanged 
since y,21/2er when er2=1. The proof is complete since a, is homogeneous 
of degree 0 when er21. 


Theorem 19.3.1’. Theorem 19.3.1 remains valid for every aeS(1,G) with values 
in L(C’,C’) such that a~! exists and is bounded outside a compact set. 


Proof. Define a, as in Lemma 19.3.3 and set b,=Wa,' with w chosen as in 
the proof of Theorem 19.3.1. Then the symbols of a?(x, D) b¥(x, D)—I and 
bY (x, D)a”(x,D)—I are bounded in S((1+|x{)"'(1+|é))~',@, so it follows 
from Theorem 19.1.10 in view of Theorem 18.6.6 that aY(x, D) is a Fredholm 
operator with index independent of ¢. Since (19.3.1) is valid for a, when e>0 
and both sides are independent of ¢ when ¢20 it follows that (19.3.1) is also 
valid for ¢=0, which proves the theorem. 


Theorem 19.3.1’ is applicable in particular to the pseudodifferential oper- 
ators in IR" which we encountered in Section 19.2. This is the reason why 
we have restricted ourselves to such a special metric in Theorem 19.3.1. A 
moment’s reflection shows that the proof of Theorem 19.3.1 is directly 
applicable to much more general metrics. It seems plausible that Theorem 
19.3.1 is valid for every o temperate metric such that sup g,.¢/g? g — 0 as 
(x,é)— 00 but no proof seems to have been given in such generality. 

In Theorem 19.3.1 and Theorem 19.3.1’ one can replace a”(x,D) by 
a(x, D). In fact, we have a(x, D)=a'(x, D) where 


a, —aeS((1+|x|)~*(1+16))"7,G). 


If B is sufficiently large it follows that a,=ta,+(1—t)a, OStS1, is a 
homotopy between the maps 0B-»GL(v,C) defined by a and by a,. Thus 
the right-hand side of (19.3.1) does not change if a is replaced by a,, which 
proves the claim. This completes in principle the determination of the index 
of elliptic operators. 
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In geometry elliptic complexes rather than elliptic operators occur naturally. 
To define this notion we assume that we have a compact C® manifold X, 
C*” complex vector bundles Ey, E,,...,Ey on X and polyhomogeneous 
pseudo-differential operators of the same order m 


(19.4.1) 0 C*(X, E,@ Q4)—2+ C*(X, E, @24)—> ... 


N=, C(X, Ey ® Q*)->0 


forming a complex, that is, the compositions D;D;., vanish for j=1,..., 
N—1. The complex is called elliptic if for all (x,é)eT*(X)~0 the principal 
symbol sequence 


do(x, 6) ay - 1{x,€) 


(19.4.2) 0-> Eo, »E 


> Ey, 70 


1x 


is exact, that is, the range of each map is exactly the kernel of the next. The 
basic example is the de Rham complex of differential forms on X with D, 
defined by the exterior differentiation from j forms to j+1 forms. 

After introducing a hermitian structure in each E, we obtain adjoint 
operators 


D*: C°(X,E,, ,@Q*)— C*(X, E,® 0). 
The exactness of (19.4.2) means precisely that for every j 
f + (Dj-1Dj_, + Dj Dp f 
is an elliptic operator on sections of E,@ Q*. The kernel 
(19.4.3) K,;={feD(X, E,@ 0%); D, f=0, D*_, f=0} 
is therefore a finite dimensional subspace of C™(X,E j8 2), and for every 
feH,, orthogonal to K, one can find géH,,, >) with 
(D¥D,+D,_,D*_sg=f 

If D;f=0 then f and D,_,D*_,g are orthogonal to D¥D,g so it follows 
that D#D,g=0 and that f=D,_,u where u=D* ,geH Thus D, 


(s+m)° j-3} 


maps Hi.4m) on the set of all feH(, with D; f=0 orthogonal to K;. Hence 
K, can be identified with the quotient of the kernel of D, by the range of 
D,_, if D; and D,_, operate on C® sections and also if they operate on 
distribution sections. The Euler characteristic 


(19.4.4) Yi (-1) dim K, 
is equal to the index of the elliptic operator 


C*(X, BE, ;@ Q4)3(ug, Ug, ...)h>(Do Uy + Dt uy, ...)ECAUX, @ Ep, @ 2") 
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obtained by splitting the complex, so it can be determined with the methods 
developed in Sections 19.2 and 19.3. 

We shall now show that Theorem 19.1.15 is valid not only for trace class 
operators R, but also for arbitrary R, such that the calculus of wave front 
sets makes it possible to define a trace by restricting the Schwartz kernel to 
the diagonal. 


Theorem 19.4.1, Let (19.4.1) be an elliptic complex and let K, be the finite 
dimensional cohomology spaces Ker D,/Im D,_, isomorphic to (19.4.3). Assume 
given operators 


Rj: C°(X, E;® Q}) > D(X, Ej ® Q}) 
commuting with the complex, that is, assume that 
(19.4.5) R,Djyf=Dj;-1Rjif, j=l,..,N; feC?(X, Ej, @Q}). 


Then R, induces a linear map R;: Kj K,. Assume that the Schwartz kernel 
RED (Xx X, E;RE;@Q} x) has a wave front set disjoint with the conormal 
bundle of the diagonal in X x X, and denote by Tr(R;) the integral over X of 
the trace of the pullback €9'(X, Hom(E,, E))@Qx) of B; by the diagonal 
map X > X x X. Then we have 


(19.4.6) > gy THR)=¥( —1) Tr(Rj|K}). 
0 0 


Proof. Let F; be a parametrix for the “Laplace operator” D;_, D}_,+D* Dj. 
Thus the operators 


(19.4.7)  (D,_,D¥.,+DED)F,-1;,  F(Dj_,Df_,+D#D)-1, 


have C® kernels if I; is the identity operator on sections of E;@Q*. We 
define a parametrix of the complex by 


G, =D? F; 


J+ i 


j=0,...,N—1. 
The essential property is that the operators 
(19.4.8) hj=1,—-D,_,G;_,—-G,D; 


J 
have C® kernels. To prove this we first observe that 


hy=1j;—Dj_1 Df Fy— DF Fi 


D;. 
Now we have 
DD DF, + D? D,)=D,D* D,=(D,D¥+D}, ; D,, 1) D;- 


If we multiply to the left by F,,, and to the right by F, and use that the 
operators (19.4.7) have C® kernels it follows that F,,,D,—Dj,F, has a C™ 
kernel. Since 

hy=1,—(Dj_, Dj_ + DF DJF + DFO Fi — Fj, 1 Dy) 


j-1 


it follows that h, has a C™ kernel as claimed. 
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h, also commutes with the complex, that is, D;h;=h,, , D,, for both sides 
are equal to D,—-D,G,D,. If feKerD, then h,f=f—D,_,D7¥_,F, so h, 
induces the identity in the quotient spaces K;. Hence h,;R,h, induces the 
map R, there, and h,R,h, is a continuous map from J to C™ so it is a trace 


class operator. By Theorem 19.1.15 we have therefore proved that 

¥(-1/ Tr(R,|K)=>(— 1) Tr(h, R; A). 
Now h,R,h,;—R, is a sum of a number of terms each containing at least one 
factor F, or F,,,. We shall make these terms disappear by choosing a 
sequence of parametrices converging to 0 in a suitable sense. This requires a 
brief review of properties of the wave front set established in Section 8.2 and 
the construction of parametrices given in Section 18.1. 


First recall from Theorem 8.1.9 (see also Theorem 18.1.16) that if 
aeS”™(IR" x R”) and is the kernel of a(x, D) then 


WF(./)< N(A)= {(x, x, €&, —€); xeIR", €ER"~ 0}. 


If a’ is a bounded sequence in S$” which converges to 0 in the C™ topology 
then the kernel of a’(x,D) converges to 0 in 9’, and it follows from the 
proof of Theorem 8.1.9 and the arguments following Definition 8.2.2 that it 
tends to 0 in By,,(R" x R"). For example, if ye Co (IR") and y=1 in the unit 
ball, then a’(x, €)=(1 —y(&/v)) a(x, &) is a sequence to which this applies, and 
a’(x, D)— a(x, D)eOpS~” has a C® kernel. Writing a parametrix T in the 
form >) $, T,W;, where ¢,,, have support in a local coordinate patch and 
T, is a pseudo-differential operator in the local coordinates we can apply 
this observation to each term. Thus we can choose sequences F of para- 
metrices above with kernels converging to 0 in Dy,,), and this implies that 
the kernel of R;—h} R,h} converges to 0 in Dy(X x X) if M=WF(R)). Here 
we have used Theorem 8.2.14. But then it follows from Theorem 8.2.4 that 
the pullback by the diagonal map converges to 0 in Y’(X), which completes 
the proof. 


We shall now discuss the classical Lefschetz fixed point formula as an 
example of Theorem 19.4.1. Thus we consider the de Rham complex 


0 C°(X, A) C°(X, A!)—44 C*(X, A2)-4+... 


where A/ are the exterior powers of the complexification of T*(X) and d the 
exterior differential operator. Then the spaces K, are the de Rham cohomol- 
ogy groups, that is, the cohomology groups over C of the compact manifold 
X. If ¢: X +X is a C® map, then the pullback of forms defines maps 


G,: C%(X, A) > C*(X, A) 
commuting with the complex since d¢*f=* df for every differential form 


J. The Lefschetz number L(@) of ¢ is the alternating sum of the traces of the 
maps induced by ¢* in the de Rham cohomology spaces. 
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Now the kernel of ®, is a simple layer on the graph {(x, 6(x))eX x X} of 
g. (See Example 5.2.5.) By Example 8.2.5 the wave front set is the conorma] 


bundl 
age {(x, 6(x), —''(x) n,n); xEX, NET Fy} 


of the graph which intersects the conormal bundle of the diagonal only 
when x=(x) and ‘d'(x)y=y. Thus the wave front set and the conormal 
bundle of the diagonal are disjoint if and only if 


(19.4.9) (x) =x => det(I —#'(x))+0. 


The fixed point x is then said to be non-degenerate. (Here ¢’(x): 
T,{X)— T,(X), and I is the identity map in T,(X) so the determinant is well 
defined.) By the inverse function theorem non-degenerate fixed points are 
isolated so if all fixed points of ¢ are non-degenerate there are only a finite 
number of them. Let us choose local coordinates x,,...,x, vanishing at 
a non-degenerate fixed point. In a neighborhood the kernel of ®, is then 
6(y—(x)) and the kernel of ®, is A(x) 6(y—(x)) where Ax)= A! *’(x) is 
the linear transformation in A’ C" defined by composition of a j form with 
the differential ¢'(x). The alternating sum of the traces of the pullbacks by 
the map x(x, x) is 

(Y(-1) Tr A,(x)) d(x — G(x) = det (I — '(x))/ldet (I — $'(x))] 6(x). 
Here we have used Example 6.1.3 and made a standard expansion of 
det(I —‘d'(x)). Theorem 19.4.1 now reduces to the Lefschetz fixed point 
formula: If all fixed points of @ are non-degenerate then the Lefschetz 
number L(q) is 
(19.4.10) Lip)= > sgndet(I—$'(x)), 

¢ 


(x)= x 


that is, the number of fixed points counted with signs. 
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In order not to complicate the exposition we did not discuss in Section 19.2 
whether ellipticity is necessary for the Fredholm properties. We shall now 
show that this is true in some sense. By X we denote a compact C™ 
manifold and by E, F two C® complex vector bundles on X not necessarily 
of the same fiber dimension. 


Theorem 19.5.1. If Pe¥"(X; EQ@Q?, F@Q*) then the following conditions 
are equivalent: 

(i) The operator P: H.,(X; E@2*)> Hy,_.m(X;3 F @Q*) has closed range 
and finite dimensional kernel for every s. 

(ii) Condition (i) is fulfilled for some s. 
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(iti) The principal symbol p has a left inverse qeS~"(X; Hom(x* F, x* E)), 
that is, qp—ideS~3(X; Hom(x* E, x* E)). 

(iv) The operator P has a left parametrix Qe¥-"(X;F@Q*, EQN), 
that is, QP=I+R where R is an operator with kernel in C°(Hom(E, E) 
@ 4, x)- 


Proof. (i)=(ii) is obvious. If (ii) is fulfilled then we can find a constant C 
such that 


(19.5.1) Hells) S CUP ll cs my + Uell(s— ay)» ueH,,(X; E@Q?). 
Indeed, otherwise we could find a sequence u, with ||u,||,.=1 and 
|Pujll(s—m <1, I] jllis—1)< jj. 

By Proposition 19.1.3 it follows that u,; has a subsequence converging in 
Hyg (X ; E®@*) which is absurd since the limit must have norm 1 and be 
equal to 0. It is no restriction to assume that s=m for otherwise we can 
multiply P to the left by an elliptic operator of order s—m. Let Y be a local 
coordinate patch with local coordinates x,,...,x,, where E and F are 


identified with Yx C" and Yx C%” respectively. Choose weC?(Y) equal to 
1 in a large open subset Z, and set with Pe C%X(Z), €eIR" and we€”, |w|=1, 


u(x)=o(x) wei. 
Then (1—yW)Pu, is rapidly decreasing when ¢->co since the kernel of 
(l-w)P¢ is in C*®. We have wPg@u=a(x,D)u in terms of the local 
coordinates, and a(x, )— p(x,¢)peS"~*. Applying (19.5.1) to u, we obtain, 
since p Pu,=a(x, d)wel®, 
(1+(E) loll SC’ lp, dw P(x) + Cyd +16) 
(see the proof of Theorem 19.2.5). Here || || is the 7 norm. This implies that 


lim inf inf p(x, €)wi/ + 1 E)" 2 1/C’. 


Paes lw} =1 xe 


In fact, assume that there are sequences w, with |w,|=1, x,eZ, €,--00 such 
that |p(x,,¢,)w|/1+1é,)"S C,<1/C’. Passing to a subsequence we may 
assume that w,- w, x, X 9. Then 


Ip &) wh <3(Co+1/C) (1 +161)" 


for all x in a neighborhood U of x, independent of v. When @ECP(ZNU) 
we obtain a contradiction since s=m. Thus 


(p*(x, 6) p(x, dw, w)2(1+1E))"Iwi7/2C%,  xeZ, 


if |¢| is sufficiently large. The determinant ¢S?”"’ of p*(x, €) p(x, © is there- 
fore bounded from below by (14+{é|)?"%/(2C’7)"’ so it has an inverse in 
S-?"" by Theorem 18.1.9. The product q of this inverse by the cofactor 
matrix of p*p and with p* is a left inverse of p belonging to S~™ over Z. 
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Piecing together such local constructions of q by a partition of unity in X 
we obtain (iii). If (iii) holds and Q, has principal symbol q then Q, P= 
I+R, where R,e¥~*. Thus [+R, has a parametrix, and (iv) is valid for 
the product of such a parametrix and Q,. By Proposition 19.1.3 it follows 
at once from (iv) that (i) is valid, which completes the proof. 


There is also an essentially dual result: 


Theorem 19.5.2. [f Pe ¥"(X ; E @ Q3, F @ Q*) then the following conditions are 
equivalent: 
(i) The range of the operator P: H,.(X; E@Q*)> Hyg m(X3 F @ Q*) has 

finite codimension for every s. 

(ii) Condition (i) is fulfilled for some s. 

(iit) The principal symbol p has aé_e “right inverse géeS~™(X; 
Hom(x* F,x* E)), that is, pq—ideS~'(X; Hom(x* F, x* F)). 

(iv) The operator P has a right parametrix Qe¥~™(X; F @Q?, E@M), 
that is, PQ=I+R where R is an operator with kernel in C°(Hom(F, F) 
@ 23 x x): 


Proof. That (i)=>(ii) is trivial. By Lemma 19.1.1 condition (ii) implies that the 
range is closed, hence 
P*: Hoy 9(X3 F*@ 24) H_(X; E*¥ @ 2%) 

has closed range and finite dimensional kernel. By Theorem 19.5.1 it follows 
that p* has a left inverse g*eS~™, and qgeS~™ is then a right inverse of p. 
That (iii)=(iv) follows just as in the proof of Theorem 19.5.1 (see also the 
proof of Theorem 18.1.9), and (iv)=(i) is clear since I+R is a Fredholm 
operator. The proof is complete. 


Remark. When P is polyhomogeneous the conditions in Theorems 19.5.1 
and 19.5.2 are of course equivalent to injectivity and Surjectivity of the 
principal symbol. 


Theorems " 5.1 and 19.5.2 combined show that P defines a Fredholm 
operator H.,— H,,_,,) if and only if P is elliptic. This justifies the hypotheses 
made in Section 19.2. However, it should be understood that the class of 
Fredholm operators changes if the norms are changed. A particularly simple 
case of importance in Chapter XX occurs when we have decompositions 


K J 
E=@F, F=@F, 


A section u of E®Q? can then be written in the form (u,,...,u) where u, is 
a section of E,@?, and sections of F@Q* can be written similarly as 
(f1,--.f)). If PeV°(X; E@Q?, F@Q*) the equation Pu=f can then be 
written 


K 
Y PaaS j=l,...J, 
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where P,eP°(X; E,@ Q?, F,@Q*). Let s,,...,s, and t,,...,t, be real num- 
bers. Then P defines a continuous operator 


(19.5.2) P: OH, (X; E,® 22) OH (X; F@ 2?) 


if Pe?" ~Si(X; E,,@ Q+, F,®Q?). 


Theorem 19.5.3. If P,eP™—*(X; E,, @ 2%, F,® 0?) then P=(P,) defines a 
continuous operator (19.5.2), and the following conditions are equivalent: 

(i) (19.5.2) is a Fredholm operator. 

(ii) One can find q,,¢S*~™(X;Hom(x* F,,n* E,)) such that (p)(q)—id 
and (q)(p)—id are in S~! if Dj, is the principal symbol of P,,. 

(ili) One can find 0,;¢ 8 1 —™ (X; F,@ 03, E, @2*) such that for the cor- 
responding operator Q the kernels of QP —id and PQ —id are in C™. 


Proof. Let A,e¥ ~*(X; E,@Q+, E,@Q*) and Be P © (X; F,@ Q#, F,@Q*) be 
elliptic operators. Then the diagonal operators A=(A,6,;) and B=(B,6;)) 
are Fredholm operators, 


A: Hoy (X; E@Q*) OH) (X; E,,@Q%), 
B: OH, (X; F,@Q*)> Ho (x; F@Q?). 


If (i) is fulfilled it follows that BPA is a Fredholm operator in I’, hence 
elliptic by Theorems 19.5.1 and 19.5.2. If b; and a, are the principal symbols 
of B, and A, it follows that (b;p,,a,) has an inverse (r,;)€S°, and (ii) is then 
satisfied by q,,;=a,7j;b;. The proof that (ii)=-(iii) is just like the proof of (iii) 
=>(iv) in Theorem 19.5.1, and (iii)=(i) by Corollary 19.1.9. The proof is 
complete. 


An operator satisfying the conditions in Theorem 19.5.3 is said to be 
elliptic in the sense of Douglis and Nirenberg, and (p,,) is called its prin- 
cipal symbol. It is clear that all we have done for standard elliptic operators 
can be extended to such operators by the simple device of left and right 
multiplication by appropriate diagonal matrices as in the proof of Theorem 
19.5.3, This is also true for the index if we take the principal symbols of A, 
and B, as positive multiples of the identity. It is clear that these operators 
have index 0 and we still obtain ind P=s—ind p. As a rule we shall there- 
fore content ourselves with discussing only the standard elliptic case and 
leave the extension to Douglis-Nirenberg systems for the reader. Thus the 
reader is now asked to state and prove analogues of Theorems 19.5.1 and 
19.5.2 for such systems. 

Even in the scalar case there exist Fredholm operators defined by non- 
elliptic pseudo-differential operators provided that the spaces H,, are suit- 
ably modified. Examples are hypoelliptic operators of constant strength (cf. 
Theorem 13.4.1) or more generally the operators studied in Section 22.1. 
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The first result on the index of pseudo-differential (or rather singular in- 
tegral) operators seems to have been proved by Noether [1] to whom the 
example discussed after Theorem 19.2.4 is due. The extension of Fredholm 
theory from compact perturbations of the identity to compact perturbations 
of arbitrary “Fredholm operators” given in Atkinson [1] made it plain that 
the index was an interesting homotopy invariant of elliptic operators. It was 
determined for various special classes of elliptic operators by a number of 
authors at the end of the 1950’s. However, the most natural and important 
examples arise in complex analysis and algebraic geometry, in the Riemann- 
Roch theorem (in higher dimensions) and the Hirzebruch index theorem. 
(See Hirzebruch [1].) These results suggested an expression for the index in 
terms of characteristic classes, and such a formula was found by Atiyah and 
Singer [1]. We have not given their explicit formula here but just the 
characteristic properties of the index established in part I of Atiyah-Singer 
[2]. (Their discussion of the group invariant situation has been omitted 
here.) For the purely geometric derivation of the index formula from these 
results we refer to part III of Atiyah-Singer [2]. The formula has the 
following character. If X is a compact C® manifold then the principal 
symbol of an elliptic operator P defines a Chern character which is a 
cohomology class with compact support in T*(X). In the manifold X there 
is a cohomology class .4(X), independent of P, defined in terms of the 
Pontrjagin classes. Then ind P is the integral over T*(X) of the product of 
the Chern character and .4(X), lifted to T*(X). To verify it from the results 
proved here one must first show that the formula agrees with (19.3.1) for 
operators in IR” which are trivial at infinity. This is easy because ¥(X)=1 
then. One must further show that the product construction in Theorem 
19.2.13 agrees with the formula at least when V is the normal bundle of an 
embedding in Euclidean space. It is here that #%(X) appears as the integral 
of the factor of the Chern character coming from the Bott operator. 

The indicated path to the index formula differs from that of Atiyah- 
Singer [2] only by relying on Theorem 19.3.1 and the construction of the 
Bott operator in Section 19.2 instead of the Bott periodicity theorem and 
some simpler normalization conditions. The proof of Theorem 19.3.1 is due 
to Fedosov [1], who has later on discussed more general manifolds in the 
same way. Richard Melrose has informed the author that in unpublished 
work he has given a proof of the full index formula for operators between 
trivial bundles by arguments of Fedosov’s type based on a careful cor- 
respondence between symbols and operators on a Riemannian manifold. 
Thus his arguments give the quite complicated index class %(X). For anoth- 
er derivation of the index formula we refer to Atiyah-Bott-Patodi [1]. 

The Lefschetz formula of Section 19.4 is essentially due to Atiyah-Bott 
[2]. They only considered operators R, related to composition with a 
diffeomorphism in X with non-degenerate fixed points but now that the 
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notion of wave front set is available Theorem 19.4.1 seems the natural 
setting for their arguments. The reader should consult Atiyah-Bott [2] for 
less classical applications than the one given here. 

Section 19.1 is largely a collection of classical results in Fredholm 
theory. We have added some strengthened results on the stability of the 
index taken from Hormander [42]. (The reader consulting this reference is 
warned that misprints are abundant since the article was never proofread.) 
The Bott operator was also discussed there. The simpler treatment of it here 
is largely taken from H6rmander [39] and relies on later progress in the 
theory of pseudo-differential operators. We also refer to Hérmander [42] 
for a proof that the index formula remains valid for fairly large classes of 
hypoelliptic operators which will be studied from a local point of view in 
Section 22.1. They include the elliptic operators in the sense of Douglis and 
Nirenberg [1] introduced in Section 19.5. Lack of space has made it 
impossible to cover the closely related results concerning Toeplitz operators. 
The reader is referred to Boutet de Monvel [3] for a discussion including an 
index theorem. 


Chapter XX. Boundary Problems 
for Elliptic Differential Operators 


Summary 


In Chapter XVII we discussed some of the classical methods for solving the 
Dirichlet problem for second order elliptic differential operators. The pur- 
pose of this chapter is to use the theory of pseudo-differential operators to 
study general boundary problems for elliptic systems of differential oper- 
ators. (See also the end of Section 17.3.) 

To illustrate the main idea in a simple case let us assume that we want 
to solve the boundary problem 


Au=0 in X, bouotb,u,=f on éX, 


where X is an open set in R” with smooth boundary 0X, A is the La- 
placean, and bo, b, are differential operators in 0X acting respectively on the 
boundary value u, and the normal derivative u,=du/dn of u. If E is the 
fundamental solution of the Laplacean (see Theorem 3.3.2) and u is smooth, 
we obtain from Green’s formula 


(20.1) u(x)= J dE(x — y)/dnug(y) dS(y)— J E(x — y) u(y) dS(y), 


if xeX. Thus it suffices to determine u, and u,. However, although (20.1) 
always defines a harmonic function u it may not have the boundary value 
and normal derivative u),u,. To examine if that is true we let x approach 
0X and obtain 


(20.2) Uy =ko ug tk, uy, 


where k, and k, are pseudo-differential operators by Theorem 18.2.17, for 
convolution by E is a pseudo-differential operator satisfying the trans- 
mission condition. Conversely, (20.2) implies if n>2 that there is a harmonic 
function u with u, and u, as boundary value and normal derivative. For if u 
is defined by (20.1) it follows from (20.2) that u=u,y on 0X. If we subtract 
Green’s formula for u from (20.1) we obtain 


J E(x—y)(u,(y)—du/dn)dS(y)=0,  xeX. 


The integral is a continuous function of x which is harmonic outside X 
and vanishes on @X and at infinity. Hence it is identically 0 which implies 
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that u,=du/dn. To solve the boundary problem is thus equivalent to solv- 
ing the system of pseudo-differential equations 


(L—ko)ug—k,u,=0, byuygtb,u,=f 


Here k, is an injective elliptic pseudo-differential operator, and it is easily 
seen that it has index 0, hence a pseudo-differential inverse kj‘. Thus the 
Dirichlet problem is solved by taking u,=ky'(1—ky)uo. In general we 
obtain pseudo-differential equation 


(bo +b, ky* —ko)ug=f 


When this is an elliptic pseudo-differential operator the boundary problem 
is called elliptic and then we can just apply the results of Chapter XIX. 

To develop the preceding ideas in general we must examine the anal- 
ogues of the operators k, and k, for general elliptic systems of differential 
equations in X. This requires only minor additions to the results proved in 
Section 18.2. They are given in Section 20.1 where we establish the Fred- 
holm properties of general elliptic boundary problems. 

Next we turn to the question of the index of an elliptic boundary 
problem. As a preliminary we discuss in Section 20.2 some elementary but 
important facts on ordinary systems of differential operators with constant 
coefficients. These are used in Section 20.3 to deform an elliptic boundary 
problem to another one with the same index which is just one half of the 
index of a natural extension of the operator to the double of the manifold, 
which no longer has a boundary. As in Chapter XIX we do not give an 
explicit formula for the index but only establish a procedure for obtaining 
it. 

In Section 20.4, finally, we give some remarks on non-elliptic boundary 
problems. These are meant to motivate why we devote so much effort to 
existence theory for non-elliptic pseudo-differential operators in the sub- 
sequent chapters. 


20.1. Elliptic Boundary Problems 


Let X be a compact C®” manifold with boundary 0X. Assume given an 
elliptic differential operator P of order m from C™(X, E) to C™(X, F) where 
E and F are complex C” vector bundles on X with the same fiber dimen- 
sion N. (We do not include half density bundles here since they would just 
complicate notation.) Thus the principal symbol p(x, €) is a bijection E,—F, 
for every €€7,*~ {0}. To set up a boundary problem we need in addition 
boundary differential operators B;: C*(X, E)> C™(0X,G,), j=1,...,J, where 
G, are C® vector bundles on 0X. (This notion is discussed in the scalar case 
in Appendix B.2. The presence of bundles makes no difference locally and 
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should be ignored by the reader who does not feel familiar with bundles.) 
The boundary problem we shail study is to find ueC™(X, E) satisfying 


(20.1.1) Pu=f in X; Bju=g,; in 0X, j=1,...,J; 


where feC™(X,F) and g,e¢C™(éX,G,) are given. Later on we shall state the 
problem under weaker smoothness assumptions also. 

In a local coordinate patch X, such that X,, is defined by x,20 and E, 
F are identified with X, x C%, the equation Pu=f has the form 


VAX) Dtu=f 
where the coefficients P are C® functions of x with values in NxN 
matrices, and the coefficient of D™ is invertible since P is elliptic. Thus we 


can write 
Dru=Pol (f— Re) D*w). 
If the transversal order of B, exceeds m—1, that is, B, contains terms 
B,.(x)D*u with «,2m, then we can replace Di'u by this expression in all 
terms with «,2m to reduce the transversal order. Repeating the argument 
and using a partition of unity to combine the local constructions we 
conclude that 
B,=Bi+C,P 


where Bi has transversal order <m and C;, is a boundary differential 
operator from C™(F) to C®(G,). Then (20.1.1) is equivalent to 


Pu=f in X; Bjyu=g,-C,f in 0X, j=1,...,J. 


It is therefore no restriction to assume that the transversal order of B, is less 
than m, as we shall do from now on. However, we shall put no restriction 
on the total order m, of B,. 

For any s2=m we can extend P from C® to a continuous map 


P: A (X°, E)> A _m(X°, F); 
here X° is the interior of X and the spaces are defined in Appendix B.2. By 
Theorem B.2.10 we also have a continuous extension 


B,: H(X°, E) +H \(6X,G,) 


(s—mj—-4 


since the transversal order of B, is Sm—1. We want to examine when the 
operator , 


(20.1.2) H,(X°, E)sut>(Pu, B,u,...,B ue 
Fi s_m(X°: EV\@H (0X,G,)®... BA. my—4)(OX, Gj) 
is a Fredholm operator. 


(s—m;—4) 


Definition 20.1.1. The boundary problem (20.1.1) is called elliptic if 
(i) P(x, D) is elliptic. 
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(ii) The boundary conditions are elliptic in the sense that for every 
xedX and €€T*(X) not proportional to the interior conormal n, of X the 
map 


My .dur>(b, (x, E+ D,n,) u(0), ...,bs(x, 6 + D,n,) u(O))EG,, @...OG,, 


is bijective, if M*. is the set of all ueC™(R,E,) with p(x,¢+D,n,)u(t)=0 
which are bounded on R,. 


Here b, is of course the principal symbol of B, and p is that of P. In the 
definition it suffices to take one € in each equivalence class mod Rn, since 
translation by sn, is equivalent to conjugation by e's. By condition (i) we 
know that p(x, é+n,) is non-singular when 7 is real, for €+1n,,+0 then by 
assumption, so all elements of M;. are in fact exponentially decreasing on 
R_,. (See also Section 20.2 for a discussion of systems of ordinary differential 
operators.) We have chosen the term elliptic since we shall see that the 
definition is equivalent to ellipticity of an associated system of pseudo- 
differential operators. However, the literature contains many other terms 
such as the Lopatinski-Schapiro condition, coerciveness and the covering 
conditions. The main result of this section is the following generalisation of 
the study of the Dirichlet problem in Section 17.3. 


Theorem 20.1.2. If the boundary problem (20.1.1) is elliptic and s2m, then 
(20.1.2) is a Fredholm operator. 


The proof will be given after some preparations which allow a reduction 
along the lines indicated in the introduction to a system of pseudo-differen- 
tial operators in @X. It is convenient to identify a neighborhood X, of 0X 
in X with 0X x (0,1) for example by introducing a Riemannian metric in X 
and taking geodesic normal coordinates with respect to dX (see Appendix 
C.5). We can also identify the bundles E and F restricted to X, with their 
liftings by the projection 2,:X,—70X, for any bundle map En E, for 
example, which is an isomorphism on @X must be an isomorphism on 
6X x [0,¢) for small «. A change of scales will replace X, by éX x [0,¢). In 
what follows we denote points in X, by (x’,x,) where x’e0X and x,e[0, 1). 
Since E,=E,,. 9) the normal derivatives D/u of sections u of E are then well 
defined. We can also extend X to an open manifold X = XU(éX x(-—1,0)) 
with the obvious C® structure. The bundles E and F can be extended to X¥ 
so that the fibers are independent of x, in OK x (—1,1). We can also extend 
the differential operator P to X. Shrinking X, again if necessary we may 
assume that P is elliptic on X too. By T we denote a properly supported 
parametrix of P on X. Thus Te ¥~"(X; F, E) and 


(20.1.3) TP=I,+R,, PT=I;+Rp, 


where I,, I, denote the identity operators on sections of E and F, and R,, 
R, are operators with C™ kernels on such sections. 
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If we C®(X, E) we write 
yu=(u,D,u,...,D™-* u)|,y¢C™ (OX, E™) 
for the Cauchy data. In 0X x [0,1) we have 
P= » P.D} 


where P, is a differential operator of order m—j in 0X depending on the 
parameter. x, We denote the principal symbol by p,. With u°=u in X and 
u°=0 in XXX, we obtain in X 


(20.1.4) Pu®=(Pu)? +P’ yu 

where for U=(Up,..., U,,_ JE C°(EX, E”) 

(20.1.5) PU=i' YP, dX U;_,@ Dko, 
j<m ks 


6 denoting the Dirac measure in x, In fact, induction with respect to j gives 
j 
Dit* yu =(Dit* uy? +i Y y;_,u@ Dks. 
0 


If T is applied to (20.1.4) we obtain “Green’s formula” 

(20.1.6) u°+R,u° = T(Pu) + TP’ yu, 

in view of (20.1.3). Note that if Pu=0 in X and R,=0 it follows that yu 
=yTP* yu, where the boundary values on the right exist by Theorem 18.2.17 


since every term in the symbol of T is a rational function. For 
UeC™(éX, E”) we therefore introduce 


(20.1.7) , QU=yTP* U; 
thus QU is the Cauchy data of the Green potential TP* U. Explicitly, 


(QU), = 5 011 YU, Db k=0,...,m—-1, 


where 
-i- 


Q,, U,= y i~* Dy TP, 414, U,®Didlox, 


the boundary values being taken from X of course. Thus Q,, is a pseudo- 
differential operator in 0X of order k—1[ with principal symbol 


m—I-1 
(20.1.8) gulx’, =m f* YS pO, * ayy 1410, 6) dE, 
j=0 
that is, the sum of the residues of the integrand in the half plane Im é,>0. 


This means that (Q,,) is a Douglis-Nirenberg system as discussed in Section 
19.5. 
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Theorem 20.1.3. The system Q of pseudo-differential operators defined by 
(20.1.7) is an approximate projection in the space of Cauchy data in the sense 
that Q?—Q has a C™ kernel. If we identify solutions of the ordinary differen- 
tial equation p(x',0, &’, D,) v=0 with the Cauchy data (v(0),..., D"™~' v(0)) then 
the principal symbol q(x’, é') is for &'+0 identified with the projection on the 
subspace M* of solutions exponentially decreasing on R, along the subspace 
M~ of solutions exponentially decreasing on R_. One calls Q, and sometimes 
also q, the Calderon projector. 


Proof. If u= TP‘ U then the restriction of u to X is in C® and by (20.1.3) 
Pu=([,+R,;)P°-U=R,P°U_ in X° 


where R, has a C™ kernel. By definition yu=QU, so if we apply (20.1.6) to 
u and take the Cauchy data from X, it follows that 


QU+yR,(TP* U)’ =yT(Rp P* UY)? +Q? U. 


Now the map Ut>R,P°U is continuous from 9'(dX, E”) to C%(X, F), and 
since T satisfies the transmission condition it follows that the map 
UH+yT(R,; PU) is continuous from J to C®, hence has a C™ kernel. If 
pe C™(X, E* @Q,) and UeC™, then approximation of @ by functions van- 
ishing near 0X gives 
((TP* U)°, 6) =(P° U, T* 6°). 

(Note that T*°¢C™"~'(X,E).) Since T* also satisfies the transmission 
condition it follows that T* #°<C™(X), which proves that Ur+(TP* U)° can 
be extended to a continuous map from 9'(dX,E") to Y(X,E). Hence 
Ut>yR,(TP* U)° is also continuous from 9’ to C” which proves that Q? 
—Q has a C™ kernel. 

To interpret the principal symbol we observe that if U=(Up,..., 
U,, JE EC, 9) then the inverse Fourier transform 

v(x,)=(2%i)—* J p(X,0,E)°* DY yr ra re, 0,0) 4 Ue dE, 


jtl<m 


is in #’ and satisfies the equation 


p(x',0,,D,)v=i7? Yo yyigr(',0,6) U, Di 6. 


j+l<m 


Thus v concides for x,>0 with an element v*¢M* and when x,<0 with an 
element v-e€M~. For x,=0 we have the jump conditions 


Dk(v+ —v-)=U,, k=0,...,m—1. 
The equations (20.1.8) mean that 
q(x’, €') U=(v* (0), ..., Dr! vt 0). 


Hence qU=U means that v~ =0, that is, that U is the Cauchy data of a 
solution in M*. The equation qU=0 means that vt =0, that is, that U is 
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the Cauchy data of a solution in M~. Hence q is the projection on M* 
along the complementary space M~; that q is a projection is of course also 
a consequence of the fact that the symbol of Q?—@Q is of order — co. The 
proof is complete. 


Corollary 20.1.4. The map 
C”(X, F)afreQyTf°eC*(éX, E”) 


can be extended to a continuous map from Ho(X°, F) to C*(éx, E”). 


Proof. If feC™(X,F) then u=Tf|,e C*(X,E) since T satisfies the trans- 
mission condition. It follows from (20.1.3) that Pu=f+R,f° in X. Thus we 
obtain from (20.1.6) 


yut+yRyuo=yTfo+yT(Re f°) +Oyu. 


The map f + Rrf° is continuous from Ho to C~, sof + yT(Rr f°) is 
continuous from Ho, to C%, again by the transmission condition. Now 
f > Tf°lx is continuous from A) to Aymy(X°), so f + yReu° is con- 
tinuous from Ho, to C~. By Theorem B.2.10 the map/fteyu is continuous 
from Hjo(X°, F) to MA, ;-4(EX, E), and since Q—Q? is continuous from 


I-72 


OH im j— (0X, E) to O®C*(OX, E), the corollary is proved. 


By hypothesis the operators B, in (20.1.1) have transversal order <m, so they 
can be written in the form B;=B‘y, where B* is a differential operator from 
sections of E™ on 0X to sections of G; on 0X. If U=yu it follows from 
(20.1.6) that U—QU is essentially determined by {=Pu, so we are led to 
replace (20.1.1) by equations of the form 


U=~+QU; BoU=g,, j=t,...,J. 


At first sight it might seem that there are too many equations here. For 
example, the Laplace equation with Dirichlet boundary conditions leads to 
3 equations for 2 unknowns. However, yw is not arbitrary since Qyw is 
essentially 0 by Corollary 20.1.4. Since Q is a projection this will effectively 
decrease the number of equations to be solved as shown by the following 
proposition which for the sake of simplicity is stated for standard elliptic 
operators rather than those of the Douglis-Nirenberg type. 


Proposition 20.1.5. Let H and G be C® complex vector bundles on a compact 
manifold Y without boundary, let Qe Pel YS HH), Be'Py..(Y; H,G) and as- 
sume that Q?-Q=0, that is, that Q?-Qe¥~™®. Denote the principal symbols 
by b and by q. Then 

(i) if b(y,n) restricted to q(y,n)H,, is surjective for all (y,n)eT*(Y)~ 0, then 
one can find Se ¥5"(Y; G, H) such that 


phe 


(20.1.9) BS=I,, QS=S. 
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(ii) if b(y,n) restricted to q(y,n) H, is injective for all (y,n)eT*(Y)~0 then 
one can find S'e,,#(Y; G, H) and S"e¥),,(Y; H, H) such that 


(20.1.10) S'B+S"=I,, S”Q=0. 


(iti) if b(y,n) restricted to q(y,n) H, is bijective, then S, S’, S” are uniquely 
determined mod ¥-®, and S’=S. 


Proof. In case (i) we can by the polyhomogeneous form of Theorem 19.5.2 
applied to BQ find TeS34(Y; G,H) such that BOT=I,. Then S=QT satis- 
fies (20.1.9), for 


BS=BQT=I,, @QS=(Q*—-Q)T+QT=OQTS=S. 
In case (ii) we can use the polyhomogeneous case of Theorem 19.5.1 for the 
Douglis-Nirenberg system B@(Iy—Q) to find T’ePi*(Y;G,H) and 
T"e P%,,(¥; H, H) such that 
T B+T"Uy—Q)=Iy. 


Then S’=T’ and S”=T"(I,,—Q) satisfy (20.1.10) since S’Q=T’(Q—Q’)=0. 
In case (ili) we choose S, S’ and S” so that (20.1.9) and (20.1.10) hold. Then 
it follows that 


S=S' BS=S’ BS+S"(S—QS)=S'BS+S"S=S. 


Hence S=S' so these operators and therefore S” are uniquely determined 
mod ¥~. This completes the proof. 


In the situation at hand Q=(Q,,) where Q,,¢¥4,'(0X; E,E), and B is 
replaced by the matrix (B%,) defined by 


phg 


m-1i 
BiU= Bi U, Bye me “OX; E.G) 


By the arguments used to prove Theorem 19.5.3 it follows from Proposition 
20.1.5, if the boundary problem is elliptic, that we can find operators 


Syje VEG (OX5G;,E);  k=0,...,m—1; j=1,...,J5 


phg 


Sie Pao (OX ; E, E); k,l=0,...,.m—1; 


phg 


such that (20.1.9), (20.1.10) are fulfilled for S’=S=(S,,) and S”=(S;). We can 
use these operators to construct a parametrix of the boundary problem. 

First assume that ueC*(X,E) and that (20.1.1) holds. With U=yu it 
follows from (20.1.6) that 


U+yR,u°=yTf°+Qu, 


and the boundary conditions (20.1.1) give BSU=g,. By (20.1.10) we have 
with the obvious block matrix notation 


SB°+S"(I—-Q)=I-R 
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where Re “(6X; E”, E”). Hence 

_ U=Sg+S"(yTf°—yR,u°)+RU. 
Using (20.1.6) again we obtain 
(20.1.11) u=(I+TP°S” y) Tf°+TP*Sg+Ku, 


where 
Ku=TP(Ryu—S" yR,u°)—R,u°® 


is a continuous map from H,,,(X°, E) to C°(X,E). Thus the map 
(20.1.12) (fg) (+ TPS”) Tf + TPSg 


is an approximate left inverse of (20.1.2). It is also an approximate right 
inverse. In fact, with the notation u for the right-hand side of (20.1.12) we 
have by (20.1.3) 


Pu=([p+R,p)(I+P°S" yT)f°+PSg) 
=f°+R,f°+R, PS’ yTf° +R, P Sg 
where the second equality is valid in X°. Hence 
(20.1.13) P(I-+ TP‘ Sy) Tf° + TP Sg)=f+K,f+Kog 


where K, is a continuous map from Ho)(X°, F) to C°(X,F) and K, is a 
continuous map from 9'(¢X,@G,) to C*(X, F). We have 


Be yu= B(I+QS")yTf° +B OS. 
Here K,=B‘QS—I is of order — o by (20.1.9), and (20.1.9), (20.1.10) give 
B°(I+QS8”")= B(I+ Q — QSB‘)= B‘ + BS Q — B°QSB‘ = BQ. 
In view of Corollary 20.1.4 it follows that 
(20.1.14) B(I+ TP*S” y) Tf°+ TP Sg)=g+K3f+K4g 


where K, is a continuous map from Ho)(X°. F) to C°(X,@G,) and K, is 
a map in C*(X,@G,) with C® kernel. Theorem 20.1.2 will follow from 
Corollary 19.1.9 and (20.1.11), (20.1.13), (20.1.14) when we have verified some 
continuity properties of (20.1.12). 

__ In the following proposition the notation ||u||,,, stands for the norm in 
H(X°) if weC™(X, E), and similarly for sections of F in X°. For the sake 
of clarity we shall write Tv for the restriction of Tv to X°. 


Proposition 20.1.6. If s=m and feC™(X, F) then 
(20.1.15) VTS ly S CUS Mem 
If U=(Up,...,U,,_ )eC*(6X, E”) we have for any s 


m—1 
(20.1.16) (TP? UlgoSC Y Wyle ja 
it) 
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Proof. \t suffices to prove (20.1.15) when f has support in a compact subset 
K of a local coordinate patch Yx [0,1) at the boundary, YcIR"~‘. Choose 
be CH(Yx [0, 1)) equal to 1 in a neighborhood of K, and let k be an integer 
>s—m. We shall first estimate |p Tf hepeatiast with the notation in Appendix 
B.2. To do so we note that 


WF Nicene = IF Weo,s— m= IF llo,s-m = WF les my: 
Here f° is considered as a distribution in IR". We can write 
poT= > TD? 
18| Sle} 
where 7, is a pseudo-differential operator of order —m and f,,=0 if «,=0. 
Since ||D* f° II. m— wy ISOliis_m—z.w Hf |BISk and B,=0, it follows that 
ID" p TH lisp S CUS Ms m—koa 


if x, =0 and |o|<k. This gives an estimate of ||¢ TE aes so 


OTE Ne ap Cl less 


Now PTf°=f+R,f° in X° by (20.1.3). Choose we C%(Y x [0,1)) so that 
=1 in a neighborhood of K and ¢=1 in a neighborhood of suppw. 
Theorem B.2.9 or rather its proof shows that 


Ne THUS COS lem + IP TF les 1,0) 


Since (1—yw) Tf° is a continuous function of f €H,) with values in C™(X) 
when supp f< K, we have proved (20.1.15). 

We may also suppose that supp Uc K when proving (20.1.16). By (20.1.5) 
we have 


m—1 
PULY ¥.@Did, = ¥ Po. Uji. 
0 


jtl<m 


Since P,,,,, is of order m—j—1—1 we have 
Vile jlle me je SOD Uj jy: 
The Fourier transform of v,®D} 6 is 6(€') é/, and 
[oUt ey) dc = Clete yee 
when j<m. Thus 
10, @ D3 5 _m.s—teky ll0; @ D2 SU (_ sn, 5) S CMV jMl(s_ ma j4-4> 


if j<m and k is an integer =>max(s,0). Hence 


—1 


[FP Uther SC YE Wile ja 


for commuting T with x’ derivatives it is easily seen that T is continuous 
from Hy _ m4 to Hy, for every integer k2=0. (See also Lemma 21.1.9 below.) 
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Since Ur+PTP°’U=R,P°U is continuous from 9'(dX) to C*, we can 
improve the estimate to (20.1.16) in the same way as in the proof of 
(20.1.15). The proof is complete. 


Remark. The estimates proved here by means of Theorem B.2.9 are in fact 
valid for every T satisfying the transmission conditions. We leave the proof 
for the reader. 
Theorem 20.1.7. The map (20.1.12) is continuous from 

Hy m(X°, F) OH. (OX, G,)@...O His m,—4)(OX, Gy) 


to F(X°, E) for every s2=m. It is an approximate left inverse of (20.1.2) in 
the sense that (20.1.11) is valid with K continuous from H,,,(X°,E) to 
C™(X, E), and an approximate right inverse as well in the sense that (20.1.13) 


and (20.1.14) are valid with 
i C) 
K, K, 


continuous from H@)(X , F) 8 B'(OX , @G;) to C°(X, F) 8 COX, @G;). 
Proof. By (20.1.15) we have 

Neg lees 
so it follows from (20.1.16) and the fact that Sj,e¥*~' that 


[IPS VT N= CW ei: 


(s-m,-— 3 


The continuity of the other terms in (20.1.12) is an obvious consequence of 
Proposition 20.1.6, which completes the proof. 


As already pointed out, Theorem 20.1.2 is an immediate consequence of 
Theorem 20.1.7 and Corollary 19.1.9. Using Theorems 19.5.1 and 19.5.2 it 
would also be easy to show that ellipticity is necessary for the conclusions 
to hold. 


Theorem 20.1.8. If the boundary problem (20.1.1) is elliptic then the kernel of 
the Fredholm operator (20.1.2) is in C~(X, E) and the range is the orthogonal 
space of a finite dimensional subspace of 


C”(X, F* @ Q,) ® C*(EX, Gt @ Qzx)®...B C°(EX, G¥ @ Qzy). 


Thus the index is independent of s. It is also independent of the lower order 
terms in P and in B,, and it is stable under arbitrary small perturbations of the 
coefficients. 


Proof. From the properties of the left inverse in Theorem 20.1.7 it follows 
that if ueH,,,(X°,E) and (20.1.1) is valid with feH,,_,(X°,F), 
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8 EH (._m,-4)(OX,G,) for some s2m, then ueH,,(X°, E). In particular, the 
kernel of (20.1.2) is contained in C™(X, E), and it suffices to prove that the 
range is defined by C® relations when s=m. Thus assume that 


VE Ho X,F*@Qy), jE Hm, 4.4m (OX, GF @ Qqy), 


and that 
(Pu,v)+)(B,u,h,))=0 when ueC(X, E). 


In particular we can define u by (20.1.12) with f=0, 
u=TP*Sg; g,eC*(0X,G)). 


Then Pu=K,g, Bu=g+K,g where K, and K, are continuous maps from 
@' to C~. Thus 


Ye .4)= —(K2 8, v)—(K4g,h) 


is a continuous linear form on g, in the distribution topology, so h,eC™. 
Since 
(Pu, v)= —>i(Bju,h)), ue C™(X, E), 


we obtain P* v=0 in X° by taking u equal to 0 near the boundary. Thus the 
Cauchy data of v are well defined (Theorems B.2.9 and B.2.8), and they are 
in C® since they can be expressed in terms of h. Using Theorem 1.2.6 
(Borel’s theorem) we can define v in X~ X° so that ve C&(X~ X°) has these 
Cauchy data on 0X and the higher order derivatives with respect to x, on 
OX are chosen so that P*v vanishes of infinite order there. After this 
extension ve Hesm?(X) and P* veC®(X). Hence ve C*(X) which completes 
the proof that the range is defined by smooth relations independent of s. 
Stability of the index under perturbations of lower order follows from 
Corollary 19.1.8 and stability for arbitrary small perturbations is a con- 
sequence of Corollary 19.1.6. The proof is complete. 


Theorem 20.1.8 shows that the index of an elliptic boundary problem is 
determined by the topological data defined by the operators involved. In 
Section 20.3 we shall show that the determination of the index can be 
reduced to the same problem on a manifold with no boundary. The proof 
will require some elementary facts on ordinary differential operators with 
constant coefficients which will be discussed in Section 20.2. It will also be 
necessary to generalize the results of this section. First we note that there is 
no need to assume that B, are differential operators. The proofs are un- 
changed if B,=Biy where Bj are pseudo-differential operators from sections 
of E™ to sections of G, on 0X satisfying the ellipticity condition. However, 
we must also allow more general operators P, and that requires more work. 
The most natural setting is to allow P to be any pseudo-differential opera- 
tor satisfying the transmission condition. However, the boundary conditions 
are then far more difficult to state (see the references given in the notes) so 
we shall use another not quite invariant class of operators which has the 
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advantage that its elements are close to being differential operators at the 
boundary. 

We shall assume as above that a neighborhood of dX in X has been 
identified with 0X x [0,1), and that E, F in this “collar” are identified with 
the liftings from 0X. Write X,=0X x [0,6) and define the extension X as 
before. We shall assume that P=P’+P' where Pie ¥",(X°) and the kernel 
has support in (X~ X,)x(X~ X,), say, and 


Pe=¥ PP Di, 
0 


Here m is a positive integer and P’ is a C™ function of x,e(—1,1) with 
values in Y", /(@X) vanishing for x,>2/3 say. We assume that P? is a vector 
bundle map E- F. By definition the principal symbol of P is the sum of 
those of P’ and of P'. As indicated above we allow B; to be of the form 


m-1 


» BD, 
0 


where B,,€ Pr (OX ; E,G,). The principal symbol defined in the obvious 
way is homogeneous of degree m, and is a polynomial in ¢, of degree <m. 
It is clear that Definition 20.1.1 makes sense in this more general situation 
and that we still have a continuous map (20.1.2). To extend Theorem 20.1.2 
we must reexamine the definition and the properties of the parametrix T} for 
we can no longer use a standard pseudo-differential operator. Since our 
present setup is not invariant we shall use a local variant of the arguments 
used to prove Theorem 20.1.2. 

In terms of a local coordinate system on @X the symbol of P in the 
corresponding part of the collar is a sum of a standard symbol of order m 
and a polynomial of degree m in é, for which the coefficient of €/ is a 
symbol of degree m—j in the other variables. Thus the symbol will locally 
satisfy estimates of the form 


(20.1.17) [Dz DE a(x, GS Cyp(d +18)" +E) 


where ¢’=(€,,...,€,_;). The first time we differentiate the symbol of P’ with 
respect to & the term €” drops out since P® is just a multiplication operator. 
Thus da/0¢, satisfies (20.1.17) with m replaced by m—1 for every j and not 
only for j=n. 

Before proceeding with our discussion of the boundary problem we must 
make some remarks on the calculus of pseudo-differential operators. Denote 
by S™”’ the set of all ae C®(IR" x IR”) such that 


(20.1.17) [Dz Df a(x, ES Cyp(1 + [En Ar + [er 


Lemma 20.1.9. If aeS”™ then a(x,D) is continuous from Hy4m.t+m)(R") 
to H,,(R") for all s, teRR. If beS*" then a(x,D)b(x,D)=c(x,D) where 
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cesm+m'+#" more precisely, c—abeS™te™ +" — 1 If a@esm—1™ for every j 
then c—abeS™*e— lim te 


Proof. S™™ is the space of symbols of weight (1+|é|)"(1+|é])™ with respect 
to the o temperate metric 


ldx|? + [de7/(1 +1")? +142, 17/1 +161”). 


Hence it follows from Theorems 18.5.10 and 18. 54 that ceS™*#™ +" and 
that c—abeS™t+#™+"'-1 Hence 


a,, x, D)=(1 +|DI?)°7(1 +|D'P)? a(x, D)(1 + DP)“ S+™2(1 + D'or? 


is in Op S®° and therefore I? continuous by Theorem 18.6.3. This proves 
the continuity statement. 
In order to prove the last statement we write 


b(x, €)—b(y, =>) (x;y) bl, y, 8), 
bi(x, y, Q= byly+ t(x—y), €)dt. 
When yelR” is regarded as a parameter it is clear that b, is uniformly 
bounded in S":“", and so are the y derivatives. Now 
a(x, D) b(x, D)=a(x, D) b(y, D) +) (x;—y,) a(x, D) b,(x, y, D) + C(x, y, D); 
— C(x, y,D)= —i¥ a'?(x, D) b (x, y, D). 


Thus C(x, y,é) and its y derivatives are uniformly bounded in S™+#~ bim'+#’, 
Taking y=x we obtain 


c(x, 6)=a(x, €) b(x, ¢) + C(x, x, &) 


which proves that c—abeS™*#—1-™' +" The proof is complete. 
The following lemma is essentially a case of Lemma 18.4.3: 


Lemma 20.1.10. Let ye C?(R"), aeS”°, and assume that 

la(x, 2c(1+lcl" if Cesupp(1 — x). 
Then b(x, €)=(1—y(2)/a(x, €) is in S~™°. If aAeS™-)° then b®eS-™- 1-9, 
Proof. Assume that estimates of the form (20.1.17)' have already been proved 


for derivatives of order <|x+ | of b. Then we obtain by differentiating. the 
equation ab=1—y that ; 


late, €) DEDE B(x, HS Cyp(t + 1E)- mL +E) 
and this implies that 
[Dz D8 b(x, HIS Cpl HED AL FE). 
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Since 
bY (x, = — a(x, Efa(x, €)7 


outside supp x, the second statement follows from the first. 


Remark. The lemma and the proof remain valid if a is N x N matrix valued 
and a(x, €)~‘ can be estimated by (14+ |é])~”. 


Assuming as we may that P is defined and elliptic on X we can now 
prove the interior regularity result we need: 
Proposition 20.1.11. If ucHem?_,,(X) and PueHe™?(X) then ue Heom?(X). 


(s+m— 


For every compact subset K of X we have if suppuc K 


Hell (s+ my SCOP all cy + WMll (ss m— 1)» 


Proof. Since for every PE CF 


IP bully Sb Pull. + Clulliss m—1) 


it is sufficient to’ prove the statement when u has compact support in a 
coordinate patch. We may assume that the coordinate patch corresponds to 
the set {xeIR";|x|<2} and that suppuc{xeR";|x|<4}. Set a(x,é) 
= p(us(x) x, €) where p is the symbol matrix of P in the local coordinates, for 
some trivializations of E and F, and w is chosen as in Lemma 19.2.11. Then 
aeS™°, aDesm-1° for every j, a(x, &)=p(x,&) in a neighborhood of supp, 
and |é/"a(x,€)~! is bounded for large |é|. Using Lemma 20.1.10 we can now 
choose a matrix symbol beS~™° so that bW(x,&eS~™~'>° for every j, and 
b(x, &) a(x, €) is the identity for large |é|. By Lemma 20.1.9 it follows that 
b(x, D) a(x, D)=1+ R(x,D) where ReS~'°, hence R(x, D) is continuous from 
H to Ay, , m- Thus 


Hulls 4m) D(x, D) a(x, D) ul\s 4. my + | R(x, D)Ullis.m 
S C(la(x, D) ull iy + lullissm— 1): 


This proves the proposition. 


(st+im-— 1) 


Next we shall discuss regularity at the boundary. In doing so we consid- 
er a local coordinate patch where P= P’; we trivialize E there first and then 
trivialize F using the identification of E and F given by multiplication with 
P>; After extending the symbols of P and of B ; to the whole space as in the 
proof of Proposition 20.1.11 we obtain the following situation. The symbol 
of P is of the form 


p(x, )=3 px €) 
0 


where p,<S”~/(R" x R"~'), p,, is the identity N x N matrix and |é|" p(x, ¢)~' 
is bounded for large |¢|. Similarly the symbol of B; is 


m—1 
bx’, éy= > bi lx’, é’) a 
0 
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where b,,¢S" ~““(R"~'xIR"~'). As in Lemma 20.1.10 we define 1(x,é)= 
(1 —x(€)) p(x, £)~* where ye C® is equal to 1 in such a large set that teS~™° 
and teS—™~1-° for every j. Corresponding to (20.1.3) we set 


t(x, D) p(x, D)=1+r,(x,D), p(x, D)t(x, D)=I +r;(x, D) 
where r,,7-€S~':°. Thus the operators r,(x,D) and r,(x,D) are continuous 
from H,,,(R") to H,,,1,)(R") for all s, t while t(x,D) is continuous from 
Hy (R") to Hy... my) QR"). We must also establish estimates similar to those 
in Proposition 20.1.6. If fe(R") we denote as usual by f° the function 


which is equal to f in IR", and 0 elsewhere. Let 7(x,D)f° be the restriction 
of t(x, D) f° to IR", and define 7,(x, D), #,(x, D) similarly. 


Lemma 20.1.12. If feS then 


(20.1.18) ED) ecg mgt Fes DIS lisa ty t+ Fp DVS Misa a. 
SC, lf ly, 520, 
with the norms of Hy,,)(IR"). If SEA, _m,y(IR") and x,=0 in supp f, then we 
have for every v 
(20.1.19)  7(x, D) f lle vy + WF D) F llvs 1 —me—w + lp D) Slog mt») 
SCy vl F ll mn 
where the norms on the left-hand side are in H,_\(IR",) and the norm of f on 


the right-hand side is in Hy _»,)(R"). 


Proof. The essential point is that p(x, é) has a nice asymptotic expansion as 
€,— 00. (Thus it is really the transmission condition that we use.) With the 
notation 


5=€,+ i)? +1)* 
we can write 


p(x, )=)) Px, 6) & 
0 


where P,=I and PeS”~/(R"xIR"~'). Then we can successively choose 
1,€S/(R” x IR"~*) so that for N=1,2,... 


N-1 m—1 
E" p(x) D0 (x 6)E-" J=ENI+ DY py (0,0) 54, 
0 0 


where py_; is in S/*%. In fact, for N=1 this means that tg=I and that p,, 
=P,_1_;- If the identity is established for one value of N then 


N 
B+ D(x, )Y 1x, SE 
10) 


m-1 m 
=ENtTT+ Y py (6) 5) I+) B,_ (x, 6) 51 ty(x, 6) 
0 0 
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which is of the desired form if ty(x, ¢’)= — py, o(x, ¢’) and 
Pu+i,j= Pn, joi tPa—1—j tw g) 


for j=0,...,m—1. If we multiply the identity by t(x,é)=(1—7(2)) p(x, 67! 
and by £~%, it follows that 
N-1 


t(x,6)= Y 2x, €)E-"-S + pyl(x, 6) 
0 


where 
N-1i 


m—1 
pubes f= — 16) Yo ye)" I (1-12) PCH! Dy, 2) ™ 


Since |i f\lossyS lf lli,y, $20, and the norm of f° in Ho,s4n(0R") is equal 
to that of f in Hyo,,,,(R",), the fact that pyeS~"-™™ gives when N2s 


lPn(X,D) fo Wicam.n& lPylx, DF WinccnecnyS CIS Vossen Cif lis 


The continuity of py(x,D) from Hy_»,) tO Hy, when v<N also gives an 
estimate for the term ||py(x, D)f|,,,,_y in (20.1.19). 
For the operator 


E(D)~* u(x) =i-* A exp(—t(\D’|? +1)*)t*~1 u(x’, x, +t) dt(k—1)! 
0 


the restriction of the left-hand side to the half space x,>0 is determined by 
the restriction of u to that half space. Since Op(t,5-"—4) is continuous 
from H,, (IR) to Hy, m,y(IR") we also have continuity in the corresponding 
restriction spaces in IR". This proves the estimate for 7(x,D)u in (20.1.18) 
and (20.1.19) since the terms just discussed give no contribution at all in 
(20.1.19). 

To prove the estimates for 7,(x,D) and ?,(x,D) we shall examine 
p(x, D)t(x, D)—(p t)(x, D) and t(x, D) p(x, D)—(t p)(x, D) using the same de- 
composition of t. First we observe that the symbols of p(x, D) py(x, D) 
—(p py)(x,D) and py(x, D) p(x, D)—(py p)(x,D) are in S~’%~! for every N, 
and for large enough N we can therefore argue in the same way as when we 
estimated 1t(x, D)u. The symbols of 


t,(x, D') 3(D)-"~/ p(x, D)—(«,p-"~/) (x, D) 


and the corresponding difference with the order of the factors interchanged 
are in S~'° for every j. Thus the operator is then continuous from 
Hs,)(R") to Ay, 1,)(R") for all s,t which again implies continuity in the 
corresponding restriction spaces in IR". This completes the proof. 


With P replaced by p(x, D) and T replaced by t(x, D) we now repeat the 
steps which led to the proof of Theorem 20.1.2. First we define the matrix Q 
of pseudo-differential operators by (20.1.7). Only the principal symbol of Q? 
—Q is now equal to 0, which means that Q*—Q improves differentiability 
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by one unit. The proof of Corollary 20.1.4 therefore gives that 


F..y(R" afr Qy r(x, D) FOC dy ep AG 

is continuous for s20. If the hypothesis on Q in Proposition 20.1.5 is only 
fulfilled on the principal symbol level, that is, Q7-Qe¥~', then the con- 
clusions remain valid with = interpreted as equality of principal symbols. 
We can also obtain BQS—I,e¥~~ by taking S=QT where BQ?T 
—Ige¥~°. For our Douglis-Nirenberg systems we thus obtain S,; 
=S,,e¥""™ and Sj,¢*~'. The error term K in (20.1.11) is now continuous 
from A, ,(R".) to Ayy1,)(0R4) when s2m. For the error terms 
K,,.--,K4 in (20.1.13) and (20.1.14) we obtain: 


K, is continuous from H,, ,(R",) to A,,, 1,)(R",) when s20; 

K, is continuous from Hy m—m,—»yOR"~') to Ay4 1,19 (R%) 

for arbitrary s and f; 

K, is continuous from A, )(R".) to OH o414+m—mj+y(R"~ 1) if s=0; 
K, has a C®™ kernel. 


In the verification, which we leave for the reader, it is useful to recall from 
the proof of Proposition 20.1.6 that 


IP? Ulm SCD NG illts_j—a- 


Suppose now that ueH,,)(IR".) and that u vanishes outside some large 
ball. We shall prove that ueH,,(R’,) if 


p(x, D)ueH,,_ IR"), bx, D) ue Hy, m,—3)(R"’). 
In the proof we may assume that we know already that ueH,,,(IR",) for 
some s’>m with s<s'+1. Since 

u=(1+1(x, D) P°S" y)t(x, D) f°+7(x, D) P’Seg+Ku 
the assertion follows from the continuity properties of K above and from 


the continuity of the other operators given by Lemma 20.1.12. 
Next assume as in the proof of Theorem 20.1.8 that we have a relation 


O= f (Pu)vodx+)[ Byu(x’)h(x’)dx’, ueCP(R’) 


X,>O0 


7! 1 -1 
where veH( ,(R*,) and he Hes mj+4—m(R" ). 


We shail prove that this must remain true with t replaced by t+1, hence 
for all t. To do so we take fe CPR") and set 
(20.1.20) u=((I+1(x, D) P°S”y) t(x, D) f° +(x, D) P’ Sg) 
where @ECF(R") is fixed. Then 
|Pu-—Of lo,» t+>, IBju—¢ gyll—t—mj— 44m 
= CS lo, td 2jltee- aaa) 
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by the estimates for K, given above and the continuity of the operators in 
the right-hand side of (20.1.20) which follows from Lemma 20.1.12. Hence it 
follows that 


| J Pfvdxt Li bgshiaxls CUS lo, -4-1)t Y Welt am 
This proves that pveH o, 1+ 1)0R4) and hE He 4m,+3—m) a8 Claimed. Thus 
we have h,eC®. Since ‘PveC™ and the Cauchy data of v are in C® it 
follows as in the proof of Theorem 20.1.8 that veC@(IR" ). 


Theorem 20.1.8’. Let P= P?’+P' where P' is a pseudo-differential operator of 
order m with kernel of compact support in X° x X° and P? is a differential 
operator in x, which is pseudo-differential along 0X in the collar, as described 
above. Assume that P is elliptic and that the boundary operators B, are elliptic 
with respect to P. Then the conclusions of Theorem 20.1.8 remain valid. 


Proof. First we prove that the operator 
(20.1.21) Hy (X°, E)auro (Pu, B, u,..., By we 
Hoy(X°, FO® Hm —m;—4)(OX, G;) 


is a Fredholm operator with range equal to the annihilator of a finite 
dimensional subspace of C™(X, F *@2Q)O® C* (0X, GF @Qzx). Let u, be a 


weakly convergent sequence in Hp(X°, E) such that Pu, is norm con- 
vergent in H(o(X°, F) and B,u, is norm convergent in Hi,_m,—4)(0X,G,) for 
every j. Then u, is norm convergent in i 1(X°, E) so the sequence Pu, 
has the same properties if P¢ECH(X). When dEeCH(X°) it follows from 
Proposition 20.1.11 that du, is also norm convergent in js An © Gs ,£), and if 
supp ¢ is sufficiently close to a boundary point the same conclusion follows 
from the fact that the operator K above is compact from H,,, to Hi). By 
Proposition 19.1.3 this proves that the operator (20.1.21) has finite dimen- 
sional kernel and closed range. Now we have proved that the orthogonal 
space of the range is in C°(X,F* @Q,)®@ C°(OX, Gt @Q,y). (The regu- 


J 
larity in the interior follows from Proposition 20.1.11 applied to the adjoint 
of P.) By Lemma 19.1.4 and the closed graph theorem this implies finite 
dimensionality. If s20 and 


(f,81: ++ BCH g(X°, F)OD Ho m—mj—4)(OX, G)) 
j 


is orthogonal to this space we know that the boundary problem (20.1.1) has 
a solution ueH i (X°, E). We have also proved that the solution is in fact 
in H,4_)(X°,E) in a neighborhood of the boundary; the same conclusion 
follows in the interior from Proposition 20.1.11. The proof is complete. 


Theorem 20.1.8’ contains a global regularity statement which it is useful 
to microlocalize to a result analogous to Theorem 8.3.1. In a preliminary 
step we keep the hypotheses on P and B,. 
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Lemma 20.1.13. Let P and B, have the properties in Theorem 20.1.8’, let u 
and f be in W(X), and assume that 


Pu=f in X, B,u=g, in 0X, j=l,...,J. 
Then ; 
(20.1.22) WE,(u)lax = WEDlex¥ U WF(g }). 
fz 


Proof. It follows from Theorems 18.3.32 and 18.3.27 that the left-hand side 
contains the right-hand side. To prove the opposite inclusion we first choose 
5>0 so that WF(u) contains no element (x,¢) with ¢’=0 and 0<x,<6, 
which is possible since ue. V(X). (See the proof of Theorem 18.3.32.) Let W’ 
be the union of the right-hand side of (20.1.22) and 


{(x, £1); 0<x,S6, (x, )eWF(f) for some ¢,}. 
This is a closed set since WF,(f) is closed. We shall call a symbol 
yeS™(R , x T*(0X)) admissible if x(x, ¢’)=0 for x,26 and x is of order — oo 
in a conic neighborhood of W’. Thus y(x,D’)feC® by Theorems 18.1.36 


and 18.3.32 if y is admissible. 
We can choose s and t so that ue Hie, (X9), hence 


(20.1.23) x(x,D')ueH,,,, for every admissible yeS°. 


The same result follows with t replaced by t—yp if yeS". From (20.1.23) it 
follows that for admissible yeS° 


(20.1.24) Py(x, D')u=x(x, D) f + LP, x(x, D’)] UucH m4 ee 
for [P, x(x, D’)]= ) Dix;(x,D’) where y;¢S"~*~/ is admissible. Hence 
j<m 


(20.1.25) Di x(x, Due As j4.1,1—1) 


if j=m. By induction we obtain the same result for j<m, for if it is already 
proved with j replaced by j+1 then 


Dit! y(x, Due His j.r—1» Di x(x, D')ueH 


(s~ jt) 


by the inductive hypothesis and (20.1.23). In view of TheoremB.2.3 we 
conclude that (20.1.25) holds. When j=0 we have proved (20.1.23) with (s, t) 
replaced by (s+i,t—1). After a finite number of iterations we obtain 
(20.1.23) with some new (s,t) such that s=m. Then 


B, x(x, D))u=x(x',0, D’)g;+[B;, x(x, D’)] ue A. ¢—my+4) 


so. recalling (20.1.24) we conclude using Theorem 20.1.8’ that 
1(x, D')ueH,,, ;,. Thus (20.1.23) is true with s replaced by s+1 and no 
change of t, which means that (x, D')ueC™(X) and proves that the left- 
hand side of (20.1.22) is contained in the right-hand side. The proof is 
complete. 


20.2. Preliminaries on Ordinary Differential Operators 251 
Lemma 20.1.13 can be given a more general form: 


Theorem 20.1.14. Let X be a C” manifold with boundary 0X, and let P be a 
differential operator from C™(X,E) to C*(X,F) where E and F are complex 
C® vector bundles with the same fiber dimension. We assume that 0X is non- 
characteristic. Let B,,...,B, be boundary differential operators, and define the 
characteristic set Char(P;B,,....B))<T*(0X)\0 as the set of all 
(x’,E)eT*(OX)~0 such that either P is characteristic for some (x,é)€T*(X) 
in the inverse image of (x',é') under the natural map, or else the map in 
condition (ii) of Definition 20.1.1 fails to be injective. If u, fe V(X) and Pu=f 
in X°, Bju=g, in 0X, j=1,...,J, then 


J 
(20.1.26) WFE,(u)|gy <Char(P; B,,..., BJUWE,(/laxV U WF(g)). 
J= 


Proof. We may assume that X =IR", and have to show that (0, €)¢WF,(u) if 
(0, €) is not in the set on the right-hand side. Dropping some superfluous 
B,;s we may assume that the map in condition (ii) of Definition 20.1.1 is 
bijective at (0, €). Write 


P(x, )=S P(x, &) G, 
0 
and set 


m~1 
P(x, €)= By(x) of + yy (B(x, 2) W(F) + BO, 29 1eV1Sol) A -w(e’) 2 


where weC™(IR"~') is homogeneous of degree 0 in & when |é'|21 and 
equal to 1 in a conic neighborhood of &. If the support of w is small 
enough then P is elliptic in a neighborhood of 0, and the boundary problem 
(P; B,,...,B,) is elliptic too if B, are defined similarly. Now Pu=f in X°, 
B,u=8, in 0X where 

(0, 2)¢ WEF —f)v) WF(g;—-2) 


so it follows from Lemma 20.1.13 that (0, €)¢WF,(u). The proof is complete. 
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Let E be a finite dimensional vector space over € and let Ae W(E, E). Then 
the differential equation 
(20.2.1) Du—Au=0 


where ue C™(IR, FE) and D= —id/dt has dim E linearly independent solutions. 
The solution with u(0)=u, is given by 


u(t)=(2ni)-! $(2I—A)-! uy ef? dz=¥ Res(zI — A)” ug e', 
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where the contour integral is taken along the boundary of a domain 
containing the eigenvalues of A. If A has no real eigenvalues then u(t) 
=u_,(t)+u_(t) where u, is the sum of the residues in the upper half plane 
and u_ is the sum of those in the lower half plane; u, is bounded on R, 
and u_ is bounded on R_. Thus 
(20.2.2) q= > Res(zI—A)~' 
Imz>0 

is the projection on the space M* of solutions bounded on R_, annihilating 
the space M~ of solutions bounded on R_, if solutions are identified with 
their values at 0. 

Without violating the “ellipticity condition” that A has no real eigenval- 
ues one can deform A to a very special form: 


Proposition 20.2.1. If A has no real eigenvalues and q is defined by (20.2.2), 
then 


(20.2.3) A,=(1-—1)A4+1t(iAq—id(I—q)) 

has no real eigenvalues for OStS1 if A>0, and 

(20.2.2)' q= > Res(zQI-A)-', OStS1. 
Imz>0 


Proof. Let q, be the residue of (z]—A)~* at z. Then g,=0 unless z is an 
eigenvalue, and q, is then the projection on the space of generalized eigen- 
vectors belonging to the eigenvalue z along the space spanned by the other 
generalized eigenvectors. Thus the projections gq, commute and 


q= )) a. 
Imz>0O 
The restriction of A—zI to g,E is a nilpotent operator and the restriction 
of q is the identity (zero) if Im z>0 (Imz<0). The restriction of A, to q, E is 
thus the sum of a nilpotent operator and the identity times 


(1—t)z+tidsgnImz, 
which has an imaginary part of the same sign as Imz. Thus A, cannot have 
a real eigenvalue when 0<1<1. The spaces of generalized eigenvectors do 
not depend on 7 although the eigenvalues do. The proof is complete. 


The interest of the proposition is that on one hand Ap is an arbitrary 
operator with no real eigenvalues and on the other the homotopic A, is 
very special. We can write down the solutions of the equation Du— A, u=0 
explicitly, 

u(t)=exp(—At)quy+exp(At)(I—q) up. 


Higher order operators are not as easy to deform as first order ones. We 
shall therefore discuss a reduction to first order systems. Let for some m>1 


(20.2.4) p(z)=)> pj! 
oO 
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where p,¢#(E, E) and p,,=I. Every solution of the equation p(D)u=0 can 
be written in the form 


u(t)=(22i)~* $p(z)* U(z)e*dz=¥ Res(p(z)~ ' U(z) e*”) 


with a unique polynomial U(z) of degree <m with values in E. In terms of 
the Cauchy data U is given by 


U(iz)= Yo pj. Di *u(0)z*. 


kSj<m 
(Cf. (20.1.5).) 

In the standard reduction to first order systems one simply introduces 
Diu(t), O<j<m, as new unknowns. However, we must not introduce any 
real eigenvalues and must therefore as in Proposition 20.2.1 use the oper- 
ators D+id and D—iA/ instead for some A>0. First note that we can write 


(20.2.4 p(2)=S.a(2— Ai 2+ Air, 
0 


With (z—Ad)/(z+A)=w, that is, z=/Ai(w+1)/(i—w), this means that 


m 


Ya,wi=(1—w)/22i" p(di(l + w)/(1 —w)). 


0 


Thus we conclude that for some constants c ;, 


(20.2.5) a= 


k—m 
j Cin De A . 


of): 


If p(D)u=0 and we set 
u,=(D—Aiy(D+Ai"* Ju,  OSj<m, 


it follows that we obtain a system of first order equations 
m—1 
An (D—Ai)Up_ 1+ Y a(D+Ai)u,=0, 
0 


(D+Ai)u,-(D—Ai)u;_,=0, O<j<m. 


This is essentially the usual procedure modified by the Mobius transfor- 
mation above. 

In order to make a smooth transition to the original equation we define 
for 0S$tS1 a polynomial P(z) with values in Y(E”, E”) by the block matrix 


(20.2.6) 
pz) OF .. Ro(t,z)  Ry(t,z)... ee R,,_ (t=) 
0 M,(z) 0... —tM,(z) 0 
0... TL 90 ~My (2) 0 


0 Lee M,(z) 0 0 .. —TM,(z) 0 
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Here M,(z)=(z+4i", M,(z)=(z—4i)(z+Ai)""', and 
Ro(t, z)= —t™ p(z) +1” ay M9 (z), 
R,(t,z)=t"-4a;Mo(z), O<j<m-—1, 

M,(z)+1a,,M ,(z). 


The coefficient of z” in P(z) has determinant 1, and since a)4+ ...+4,,=J an 
easy calculation also gives that it has the factorization 


R,,_ 1(t,2)=Ta 


m—1 


Tot (ayt ce +q) ee Tn 1 $4 yn) | ee | Seer 
0 I 0... —tIl I 0 
0 ae 0 I O° ik —tl 1] 


Thus the coefficients of the inverse are just linear functions of ay,...,a,,, and 
this remains true if ay,...,4,, are replaced by more general operators. 

Note that the second term in (20.2.6) vanishes for t=0 and that F(z) 
becomes of first order in z if one factors out (z+/i)"~'. Let us now 
determine the solutions U=(Up,...,U,,_,) of the equation P(D)U=0. Ex- 
plicitly the equation is 


(1 —t”) p(D) Up +(D+ Ai)" (ag 1" Ug t+... +4__1 TU, _ 1) 
+1(D+Ai)"—'(D—Ai)a,, U,_,=0, 
(D+ Ai" U,=t(D+ Ai" '(D—-Ai)U,_,, O0<j<m. 
If U is a solution which is bounded on R, then the derivatives are also 


bounded so we can cancel a factor (D+/i)"~1 because (D+Ai)f =0 has no 
solution bounded on R.,, other than 0. Hence 


(D+Ai)U,=t(D—Ai)U,_ 1; (D+Aiv Uj=7(D—Aiy Up. 


The first equation can now be rewritten p(D) U, =0. 

If det p(z)+0 when zeR then the space M+ of solutions of P(D) U=0 
which are bounded on R, is thus obtained by taking U, in the set M* of 
solutions of p(D)u=0 which are bounded on R, and taking for U, the 
exponentially decreasing solution U, of 


(D+ Ail U;=74(D—Aiy Up. 


In particular, it follows that det P(z)+0 when zeR. 
Suppose now that we have a set of boundary operators 


(20.2.7) B(D)= ¥. byDiy fated, 
ksmj 


where b,,¢2(E,G,;) for some other finite dimensional vector space G,. 
Assume that 


(20.2.8) M* 5ur>(B,(D)u(0),...,B,(D)uO)eOG, 
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' is bijective. Then it follows that 
(20.2.9) M7 3Ur(B,(D) Up (0), ..., B,(D) Uj ONEDG, 
is also bijective. 

Assuming that m,<m for every j we can write 


ee) By()= Y Byle—Aie+ aint 
k<m 


Here f,, can be computed from the identity 
> Bi W* = ((L — w)/2 i" * BAIL + w)/(1 — w)) 
which shows that for some constants c;, 
(20.2.10) Bye >, CybyAt -™. 
lam; 
For UeM*+ we have 
t(D —Ai(D+Ai"—1-* Uy =(D+Ai"—! U,. 
If 7 +0 it follows that 


m—1 
(20.2.11) B,(D)Uy= ¥) t*By(D+Ai"™* Y,. 
0 
For reference in Section 20.3 we sum up the preceding conclusions: 


Proposition 20.2.2. Let p(z) be defined by (20.2.4) where pje £(E, E) and p,,=1. 
Assume that p(z) is invertible for real z. Define aye L(E,E) by (20.2.4) or 
equivalently by (20.2.5), and define a polynomial P(z) with coefficients in 
L(E", E™) by (20.2.6). Then the coefficient of z™ has determinant 1 and the 
entries in the block matrix for the inverse are linear in do,...,a,3 P(z) is 
invertible for real z, and the projection of solutions of P(D)U=0 on the first 
component is a bijection of the set My of solutions bounded on R., on the set 
M* of solutions of p(D)u=0 bounded on R.,. If B,(D) is of order m;<m 
with coefficients in £(E,G,), then (20.2.8) is bijective if and only if (20.2.9) is 
bijective. When t +0 we have (20.2.11) if UeM? and Bj, is defined by (20.2.7) 
or equivalently by (20.2.10). 

Remark. If P,, is the coefficient of z" in P then the preceding conclusions 
hold for P;,' P as well. For this operator the coefficient of D™ is also the 
identity. When 7=0 this does not affect the operator P. 
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As in Section 20.1 X will denote a compact C* manifold with boundary ¢X, 
and E, F will be two C® complex vector bundles on X of the same fiber 
dimension. We assume that a neighborhood of @X in X has been identified 
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with 0X x [0,1) and use the notation (x’,x,), x’eeX and 0Sx, <1, for points 
in this collar. There we also choose an identification of E,..), Fy,x,) with 
Ev,0)> Fox,o): We shall discuss the index of elliptic boundary problems for 
operators P of the form P=P’+P' where Pie i (X°; E,F) has a kernel of 
compact support in X° x X°, and 


p?=¥ PP Di. 
0 


Here m is a positive integer and PP is a C® function of x,e(—1,1) with 
values in Y"-/(@X) vanishing for x,>4, say; P? is a bundle map. We define 
the principal symbol and ellipticity as explained in Section 20.1. Next we 


determine the index in a very special situation: 


Proposition 20.3.1. Let E=F and assume given a decomposition E|,,= 
Et ®E-. Choose dE CH((—1, 1)) so that OS ¢S1 and ¢=1 in a neighborhood 
of 0, and consider @(x,) as a function on X with support in the collar. Choose 


Ate! (aX,E*,E*), Ae W1(X°:E, BE) 


phg phg 


with principal symbols equal to 4* and 1 times the identity, where A* >0 in 
T*(0X)\0 and A>0O in T*(X°) 0. If ueC*(X,E) the restriction to the 
collar can be written u=(u*,u~) where u* is a section of E*, and we define 


(20.3.1) P’u=@(x,)((D,+iA*)ut,(—D,+iA Jur). 
Set P'u=i(1—@(x,)) AL —(x,))u, P=P? +P! and 
Bu=u7 |,x¢C”(0X, E-). 
Then the boundary problem (P,B) is elliptic and the index of the correspond- 


ing operator H,,(X°,E)> H,._ :)(X°, E)®H,,_ ,)(0X,E) is equal to 0 when 
s=l. 


Proof. If (y,n)eT*(0X)~0 then the solutions of the equations 
(D,+iA*(y,m))u* =0, (—D,+iA-(y,n))u- =0 


which are bounded on R, are u+=0, u~ =up exp(—x,A~(y,7)), so the 
ellipticity is obvious. To determine the kernel, which we know is in C™, we 
choose a hermitian structure in E so that E* and E~ are orthogonal in the 
collar. We also choose a positive density in X which in the collar is the 
product of one in dX and the Lebesgue density dx,. If ueC™(X, E) then 


2Im(P? u, u)=((P? u, u)—(u, P? u))/i 
=2Re((pAt ut,ut)+(PA u-,u-)4+ (GD, ut ,u*) 
—(ut,oD,u*)—(PD,u-,u-)+(u-, Du fi. 


By a partial integration we move the operators D, to the other side and 
obtain in view of the positivity of A* and A~ 


21m (P? u,u)= llu* [13x — hum Idx — C (lull. 
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Since Re P' is also bounded from below it follows that 
2im(Pu,u)= —C'(u, uv), 


if weC™(X, E) and Bu=0. Thus the kernel of (P+itI, B) is equal to {0} if 
2t>C’. Replacing P by P+itI we assume in what follows that the kernel is 
equal to {0}. 

The hermitian structure and densities introduced above allow us to 
regard the adjoints of A and of A* as operators in the same bundles. 
Assume now that veC™(X,E) and heC”(éX,E7) are orthogonal to the 
range of (P, B), that is, 


(v, Pu)y +(h, um )5x =0, ueC™(X, E). 


Then we have P*v=0 and v*=0 on 6X since there is no boundary 
condition on u*. Now — P* is of the same form as P but with the roles of 
E- and E* interchanged. If the number t above is chosen large enough it 
follows that v=0, hence that h=0. Thus the index is equal to 0, and the 
proof is complete. 


Note that the positivity of the imaginary part of the symbol makes all 
operators considered in Proposition 20.3.1 homotopic when FE, E+ and E7 
are fixed. We shall now show that the index theorem on manifolds without 
boundary allows one to handle general symbols in the interior of X. Thus 
we keep the assumption that there is given a decomposition E|,,=E* @E-. 
We also assume that E is identified with F in the collar 0X x [0,1) but do 
not require that this is true in all of X. As before P® is defined by (20.3.1), 
but P! is now any operator in Pagid 32.F) with kernel of compact support 
in X° x X° such that P= P+ P’ is elliptic. To determine the index we shall 
form the double X of X consisting of two copies X, and X, of X identified in 
OX. The C® structure is defined by identifying the collar in X, with 6X 
x [0,1) and that in X, with 0X x(—1,0] by changing the sign of the x, 
coordinate. On X we “have a bundle E obtained by gluing together the 
bundle E in X, and in X,, and a bundle F is obtained by gluing together 
the bundle F i X, with the bundle E in X,. This is possible and the vector 
bundle structure is obvious since in the collars both bundles are obtained 
by lifting the same bundle from @X. We keep the definition (20.3.1) of P® in 
the double collar dX x(—1,1)c X and choose P? 2E Pig(X95E, E) as in Prop- 
osition 20.3.1. If we regard P} as an operator in X then P= P’+ P' +P} maps 
sections of E on X to sections of F, and P has an elliptic principal symbol p 
in our present sense although P is not quite a pseudo-differential operator. 


Proposition 20.3.2. Under the preceding assumptions P is a Fredholm operator 
A(X, E)> Hy_1)(X,F) for every s2=1. The index of P is equal to the index 
of the boundary problem (P, B) and also equal to s—ind p. 


Proof. We can rewrite the equation 


(20.3.2) Pi=f 
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where ie H,(X, E) and feH,.. (X, F) by regarding the restrictions of # and 
of ie to X, and to X, as sections Uy ,U,EH,.(X, E) and feHy. 1(X, F), 
f,€H,_ 1)(X, E) such that 


(20.3.2) Pu,=f,, Pu,=f, im X; up=uy, up=uz in Ox. 


Here P, has the form in Proposition 20.3.1 but with the roles of E* and E~ 
interchanged since D, is replaced by —D, when the sign of x, is changed. 
The boundary operator 


B(u,,u,)=(ug —tuft,uy —tuz) 


is elliptic with respect to the system in (20.3.2Y for any t; the boundary 
conditions in (20.3.2) are obtained when t=1. In fact, for the bounded 
solutions of the ordinary system of differential equations which occurs in 
Definition 20.1.1 we find as at the beginning of the proof of Proposi- 
tion 20.3.1 that uf =uy =0, and the boundary conditions then give uy =u} 
=Q. The index is therefore independent of t, and when t=0 it is equal to 
ind(P, B) by Proposition 20.3.1 for then there is no coupling between u, and 
u,. Thus the index of the elliptic boundary problem ((P, B), B,) is equal to 
the index of the boundary problem (P, B). 

Now the index of the operator (20.1.2) is equal to the index of the 
operator 


{ueH,,(X°, E); Bu=O}aut+ PueHy,_ »(X°, F) 


provided that the equation Bu=g alone can be solved for any geC™, for 
this restriction of the domain does not change the kernel and gives an 
isomorphic cokernel. In our situation the map(u,,u.)'-> B,(u,,u,) is ob- 
viously surjective so this observation is applicable. Hence we conclude that 
P is a Fredholm operator, and the index is equal to the index of (B, P) since 
we can return from (20.3.2)' to (20.3.2). 

By Lemma 19.2.6 we can choose a sequence Af e¥),,(X; E, F) with prin- 
cipal symbols converging to those of A* so that A*—A}>0 in 
LH (X, E), Ay i(X, F)). For s2=1 we conclude for large j ‘that the 
pseudo-differential operator PB obtained when A* is replaced by Aj is a 
Fredholm operator with the same index as P. In view of Theorem 19.2.4 it 
follows that ind P=s—ind p, which completes the proof. 


The boundary problems covered by Proposition 20.3.2 may seem very 
special but the homotopy invariance of the index gives it a surprisingly wide 
scope: 


Proposition 20.3.3. Let (P,B) be a first order elliptic boundary problem with B 
of order 0. Then the direct sum with a suitable operator of the form studied in 
Proposition 20.3.1, with E=E*, is homotopic to a boundary problem for which 
the index is given by Proposition 20.3.2. 
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Proof. As usual we may assume that E,=E,,. 9, in the collar. The coefficient 
P? of D,, in P? is an isomorphism of E on F in 0X x [0,c] for some ceé(0, 1). 
We can use it to identify E and F then. Thus 


P? =D, — A(x,) 
for OSx,Sc, where A(x,)e ¥,,(0X; E, E). Replacing A(x,) by 
(1—-tW(x,)) A(x,) +t U(x,) AO), OStS1, 


we obtain a homotopy of elliptic boundary problems if y=1 in a neigh- 
borhood of 0 and vanishes outside another small neighborhood, 0<wW<t. 
Changing c also we may therefore assume that A is independent of x, for 
O<x,<c. We may also assume that the support of the kernel of P' does not 
meet X x (0X x [0,c]) or (0X x [0,c]) x X. 


Step I. Let a be the principal symbol of A and let 


q(y,n)=(2 ni) 1 J* (zl —aly,m))"* dz 
be the Calderén projector. It is a C® projection map in E lifted to 
T*(0X)~0 and is homogeneous of degree 0. Choose Qe Pi g(OX ; E, E) with 
principal symbol gq and Ae Pn g(OX ; E,E) with principal symbol equal to a 
positive multiple 4 of the identity. With ¢e€C?((—c,c)) equal to 1 in a 
neighborhood of 0 and OS @<1 everywhere we define P’=P’ when x,2c 


and 
P’=D,—(1-1¢)A—t@GAQ-iA(I-Q)) when x,Sc. 


By Proposition 20.2.1 P=P’+ P' is then elliptic and the Calderén projector 
is equal to q for all te[0,1]. If (P,B) is elliptic it follows that (Rf, B) is 
elliptic with the same index. Note that 


PP=P?+(1—9)P*; P’=$(D,-i AQ+i AU —Q)). 


The kernel of (1—¢)P® has compact support in X° x X° so using Lem- 
ma 19.2.6 we can find a genuine pseudo-differential operator P' with kernel 
of compact support in X° x X° and symbol so close to that of (1—@) P?+P' 


inet (1—-1)P+1P 
is elliptic for O<tS1 if P=P*+ P’. At the boundary these operators do not 
depend on 7 so (P, B) is elliptic with the same index as (P, B). 


Step II, We can now start with an operator P such that 
(20.3.3) P’=$(D,-iAQ+iAU—Q))=¢(D, +i AI —2Q)). 


The difference between this situation and that in Proposition 20.3.2 is that 
the “projection operator” Q in (20.3.3) is a pseudo-differential operator and 
not as in Proposition 20.3.2 just a bundle map on 0X. Let Be ¥},,(0X; E,G) 
be our boundary operator. (Since P is of first order the boundary operator 
only acts on the boundary values, that is, it does not involve D,.) That (P, B) 
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is elliptic means that the principal symbol b(y,7) is bijective from the range 
of q(y,y) to G, for all (y,n)eT*(OX)~0. Recall from the proof of Proposi- 
tion 20.1.5 that this is equivalent to the existence of a (unique) map s(y,y): 
G,—> E,, 9) Such that 


(20.3.4) b(y,n)sy.n=Ig,,  a(y.n)s.m=sy.n), 
s(y,n) b(y,n) a(y,n)= a(y, 1). 


The last equality follows since sbs=s by the first one, the range of s is 
contained in that of q by the second one, and the equality of dimensions 
implies that they are equal. For 0<@< 72/2 we now define 


bo(y,n)=(cos 6 b(y,n), sinO Ig):  Eyo,®G,>G,, 


8 s(y, 
so (= S(y,n) 


sin 0 Ig a a 
(cos 8)? q(y,n) sin 6 cos @ s(y, ” 


ano" (sn acord biy.naQn) (sin 8)? Ia, 


Direct calculation shows that q,{y,7) is a projection operator in E, »@G, 
and that 


(20.3.4) bg Sp=Le,> JoSe=5e, Sy be4g=4o; 


the last equality follows as above since the range of the projection q, has 
dimension dim G, which implies that s, and q, have the same range. When 0 
varies from 0 to 2/2 we obtain a homotopy connecting the projection q(y,7) 
@0 to the projection on the second component. 

To exploit this homotopy we must first choose a vector bundle H on X 
such that H|,,=G@G' for some vector bundle G’ on 0X. This is possible 
by Lemma 19.2.14 which shows that we can take H= X x €% for sufficiently 
large N. The same decomposition of H is then valid in the collar. Now 
define B, by Proposition 20.3.1 with E,F,E* replaced by H and E~ absent. 
Then F, (with no boundary condition) has index 0. We define an operator 
P, from sections of E@H to sections of F® H by 


(20.3.5) Blu, v)=(P u, B, v), 
if u is a section of E and v is a section of H, and we set 
(20.3.6) Bolu, v)=Bu. 


The boundary problem (R,,B,) is then elliptic and has the same index as 
(P,B). We shali prove that it is homotopic to a boundary problem of the 
form studied in Proposition 20.3.2. 

Let q, be the projection HG which is well defined in the collar, and 
choose a decreasing function 9(x,) such that $(x,)=1 in supp@ and 6(x,)=1 
in a neighborhood of 0. For OStS2/2 and ueC™(X,E), veC”(X, H) we 
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now define 
P?(u, v)= o((D,,u, D, v) + i(A(u—2(cos t 6)? Qu—sin 21 OS qv), 
A,(v—sin 276 BQu—2(sin 9) gg v))). 
Here Se'P9 


*ng(0X;G,E) has principal symbol s, and A,e¥,(0X;H, H) has 
principal symbol / times the identity and is assumed to be the operator 
which appears in the definition (20.3.1) of B?. With Pi equal to the direct 
sum of the interior pseudodifferential operator terms in P and in B, we set 
P= P?+ P} and 

B,{u,v)=cost But+sint gv. 


When 1t=0 these definitions agree with (20.3.5), (20.3.6), and (P, B,) is elliptic 
for every t in view of (20.3.4)’. The index is therefore independent of t. Now 


we have 
PP (u, v)=(D,ut+iAu, D,v+iA,g(v—2q¢0v)) 


in a neighborhood of 6X, and 
Bya(u, v) =4q. 


This is of the form considered in Proposition 20.3.2. (When we identify E 
with F as here using the coefficient of D, in P’ then there is a change of 
sign in the second component of (20.3.1) which makes it agree with P®, for 
the decomposition E@® H=(E@G)@G on 0X.) As at the end of Step I we 
can cut P*, off and add an approximation for the rest of this operator to 
the interior one, and this completes the proof. 


Remark. It should be observed that the homotopy of P® to the simple form 
obtained in Step II assumed that an elliptic boundary problem was known. 
There is a topological obstruction to the construction of such a homotopy, 
so elliptic boundary problems do not exist for every elliptic P. If B is of 
order +0 we can of course apply Proposition 20.3.3 after multiplying B to 
the left by an elliptic operator in 0X of order — yp. 

We shall now study some operators P of order m>1 which are very 
close to first order operators. As usual we identify E, with E,, 9) in the 
collar, and we use the coefficient of D™ in P? to identify F with E in a 
neighborhood of GX. Let A+ and A be as in Proposition 20.3.1 when E* 
= FE, and assume that 


(20.3.7) P’u=(D,—A)(D,+iAt)"-! 


for some Ae Png(OX ; E, E) when x, is small. Also assume that P is elliptic 


and that each boundary operator B, factors in a similar way 
(20.3.8) Byu=B,(D,+iA*)"-*u 
where Bye pit? —™(0X; E,G)). 


phg 


Proposition 20.3.4. Let P’=@(D,—A) where ¢€C%(R), 0S dS1, d=1 ina 
neighborhood of 0, and (20.3.7) is valid in supp. Choose weC?(—1,1) with 
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O<wWS1 and y=1 in supp ¢. Let P=P*+P* where Peet ,(X°; E, E) and the 
kernel has compact support in X° x X°. If the principal Synbol of P multiplied 
by (W(E, +iA*) +i. —)? A)" * is sufficiently close to that of P it follows that 
(P, B) is elliptic with the same index as (P, B). 


Proof. The ellipticity of P is obvious. To prove the ellipticity of the bound- 
ary conditions we observe that if a, b; are the principal symbols of A, B,, 
then the solutions of the ordinary differential equation 


(D,,—aly,n))(D,+iA*(y,m)y""*u=0;  (y,nleT*(6X)NO; 


which are bounded on R, are by assumption mapped bijectively on @G, 
by 

> (b,(y, 9) 0(0), ...,b,(y,0) 00) 
where v=(D,+id*(y,n))"~’u satisfies (D,—a(y,n))v=0. Conversely, if v is a 
solution of this equation which is bounded on R, then v=(D, 
+iA*(y,n)""'u for a unique u bounded on R_,. Thus (P, B) is elliptic. By 
Proposition 20.3.1 the operator Q, 


Qu=(D,+iAt)ut+i(l—w Ad —w)u, 


has index 0 as operator from H,, to H,,_,, for s21. Hence it follows from 
Corollary 19.1.7 that for s2m the operator 


H,(X°, E)aut+(PQ™” 1 u,B, O"~'u,...,B,Q"~ we Hy, _m(X°.F) 
OH _ mi- (OX, G, )®.. OH, my— (OX, G,) 


has the same index as (P,B). The boundary operators are equivalent to B;. 
By Lemma 20.1.9 PQ”-! is the sum of a compact operator and an operator 
in our standard class with symbol close to that of P. Hence the stability 
properties of the index show that the index is equal to that of (P,B). The 
proof is complete. 


Using Proposition 20.2.2 we shall finally reduce an arbitrary elliptic 
boundary problem to the type handled by Proposition 20.3.4. In this final 
reduction we still assume that E,=E,,.5) in the collar, and we identify E 
with F in a neighborhood of 0X so that 


P> se Di 
with P? equal to the identity for 0Sx,,Sc, say. pray Q as in the proof of 
Proposition 20.3.4 and set for sections U= (Up,.--,U,,_ 4) of E” 
P,U=(PU,,Q" U,,...,0" U,,_ 1)5 Bi U=B, Uo. 


This defines a Fredholm operator in the usual spaces with the same index 
as that of (P, B). The operator Q” is not quite of our standard class though. 
However, we can replace it by an operator Q,, which near the boundary 
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keeps the form (D,+iA*)” and differs in principal symbol and in norm so 
little from Q” that the index is not affected. We shall deform the operator 
¥, so modified, and the boundary operators as well, so that a new bound- 
ary problem is obtained which satisfies the hypotheses of Proposition 20.3.4. 
When 0Sx,Sc we define with c,, as in (20.2.5) 
A,=)¢,,R° F"~*e P° (6X; E, E) 
0 

where F is a parametrix of At. Since (20.2.4), (20.2.4) is equivalent to 
(20.2.5) we have 


9) 
so Aj+...+A,,=I. We have for OS x,Sc 
PP=)(A,(D,—-iAt)(D,+iA* "4 
(0) 


apart from terms of order —0oo in 0X and order <m with respect to x,. 
Without changing the index of (P,B) we can change the definition of P so 
that this is an exact equality. Let yeC?(—c,c) be equal to 1 in a neigh- 
borhood of 0, OSyS1 everywhere. Now we define # by (20.2.6) with + 
replaced by ty, p(z) replaced by P’, a; replaced by A,, My replaced by 
(D,+iAt)" and M, replaced by (D,—iA+)(D,+iA*)"~*. As pointed out in 
Section 20.2 the coefficient #®, of D™ has an inverse whose block matrix 
entries are just linear combinations of the A,. Without changing anything 
outside supp y we can thus define 


P =(P,)-'P,  VSx,Se. 


n 


Since this is equal to A? when y(x,)=0, we can continue the definition by 
taking P’= > for x,2c. Finally we define A as the sum of # and the 
direct sum of the interior pseudo-differential terms in P and in Q,,, taken 
m—1 times. 

By Proposition 20.2.2 the boundary problem (#,@°) is elliptic for 
O0StS1. When t=1 we can use (20.2.10) to choose Be ymitl—m ax; E™, G;) 
such that @9 for OS« <1 may be replaced by 


BU =(1—x)B, Uy +k B(D,+iAty"-!U 


because the principal parts of these operators all act in the same way on the 
functions occurring in Definition 20.1.1. Thus the index of (P, B) is equal to 
the index of (A,B), and this is a problem of the type discussed in 
Proposition 20.3.4, because (D,+iA*)"~' factors out on the right both in 
the operator A and in the boundary operator @'. 


We end this section by summing up the steps required to determine the 
index of an elliptic boundary problem: 
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(i) If m>1 one takes the direct sum with the m™ power of the operator 
in Proposition 20.3.1 with E* = E, repeated m—1 times, and then deforms to 
an operator and a boundary condition which can be factored to the right by 
such an operator raised to the power m—1. This factor is then cancelled 
(Proposition 20.3.4) so that one is left with a first order operator; the order 
of B is then reduced to 0. 

(ii) When m=1 one can use Proposition 20.2.1 to find a homotopy to a 
system of the form (20.3.3) where Q is a pseudo-differential projector. (This 
is Step I in the proof of Proposition 20.3.3.) 

(iti) After taking the direct sum with an operator provided by Proposi- 
tion 20.3.1, with no boundary condition and containing the bundles where 
the boundary operators take their values, one can by a homotopy change 
the Calderén projector to a bundle map and change the boundary condition 
similarly. (Step II in the proof of Proposition 20.3.3.) 

(iv) One can now double the manifold (Proposition 20.3.2) to determine 
the index. 

We have developed the arguments in the opposite order since each step 
seems better motivated then. 


20.4. Non-Elliptic Boundary Problems 


In the introduction we indicated how the solution of the Dirichlet problem 
for the Laplacean can be used to reduce general boundary problems to 
pseudo-differential operators in the boundary. We shall here outline a 
general form of this procedure. 

Let X be a C® manifold with boundary, E and F two C® complex 
vector bundles on X, and P: C®(X,E)>C(X,F) an elliptic differential 
operator of order m. Assume that 


B;: C*(X,E) > C™(0X,G,),  j=l,...,J, 
define an elliptic boundary problem for P as in Definition 20.1.1. Let now 
C,: C°(X, E) > C°(0X,H,), k=1,...,K, 


be another set of boundary differential operators, of transversal order <m 
of course. We want to study the boundary problem 


(20.4.1) Pu=f in X; C,u=h, in 0X; k=1,...,K; 


where f is a given section of F on X and h, is a given section of H, on 0X. 
Suppose that u is a solution and set — 


Byu=g,, j=l Ie 
Then we obtain using (20.1.11) 
C,u=C,(1+ TP’ S" y) Tf°+ C, TP’ Sg+C, Ku. 


20.4. Non-Elliptic Boundary Problems 265 


We can write C,=Cjy where C; is a differential operator from C™(0X, E”) 
to C*(6X, H,). Thus 


(20.4.2) Ci QSg=h,—C,(1+TP’S” y) Tf°—C,Ku 


where C,KueC® and the other terms in the right-hand side are known. 
Here we may replace QS by S if we add another C® term on the right-hand 
side. 
On the other hand, let us try to find a solution of (20.4.1) of the form 
(20.1.12) 
u=(1+ TPS” y) Tf°+TP*Sg. 


Then we have (20.1.13), and 

C,u=C,(1+ TP’ S" y) Tf°+ Ci OSg. 
To solve the boundary problem (20.4.1) is therefore equivalent to solving 
(20.4.3) f+K,f+K,g=f; CiQSg=h,—C,U+TP‘S’y)Tf°. 


The first equation gives f—feC~(X,F), so the right-hand side in the 
second equation is h,—C,(I+ TP°S” y) Tf° mod C®. Thus we can essentially 
reduce the study of the boundary problem to the study of the pseudo- 
differential operator (Cj QS,...,C% QS) from sections of @G;, to sections of 
@H, on 0X. We write down two cases of this principle, denoting by m, and 
by y, the degrees of B, and of C,. 


Theorem 20.4.1. Let m<s Ss,. Then the following regularity properties are 
equivalent: _ _ 
(i) If ueH,,)(X°, E), Pue Ay, _)(X°, F) and 


C,ueH \(@X,H,), k=1,...,K, 


then ucH,,,)(X°, E). 
(ii) If 8 j;€ Hn m;—4)(OX, G;) and 


COS 8H s,m, -y(OX,H,), k=1,...,K, 
»(OX,G). 


(si—uk-¥ 


then g,cH 


(So—mj— 


Proof. (i) = (ii). With g, as in (ii) we set u=TP°Sg. Then UCH yy (X°, E), 
PueC™(X, F), and 
C,u=C;, OS gEH is, uy, 4)(OX, H,). 
By condition (i) this implies that ue H, (so)(4 °, E), hence 
Bue H (.,_m,—4)(OX, G)). 


Now B ju—Bje c™ by (20.1.14) $0 gj;€H(.,m,—3) Which proves (ii). 
(ii) = (i). With u as in (i) we set g,=B,u, j=1,...,J. Then (20.4.2) gives 


COS geH.,_ y,4(OX,H,), k=1,...,K, 
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SO 8;€ is. m;— y(0X,G,) by (ii). From (201.11) it follows now that 
ueH,,..(X°, F), which completes the proof. 


In the same way one proves: 


Theorem 20.4.2. Let m<s )Ss,. Then the following existence statements are 
equivalent: e 

(i) For arbitrary SEH, m(X°, F) and h,€H,,_ y,-4)(OX,H,), K=1,...,K, 
one can find uéH,,.)(X°,E) with Pu— feC™(X,F) and C,u—h,eC™(0X, H,), 
k=1,...,K. 

(ii) For arbitrary h,eH;,,— 4,-4)(OX,H,) one can find 


(so—m;—4) (OX, G)) 
such that C,QS g—h,eC®, k=1,..., K. 


g,eH 


We leave the proof for the reader and refer to Theorem 26.4.2 for the 
relations between solvability mod C® and solvability with finite dimensional 
cokernel. 


Notes 


The definition of elliptic boundary problems was probably first stated by 
Lopatinski [1], but a large class of boundary problems for second order 
equations had been studied earlier by ViSik [1]. Elliptic boundary problems 
were discussed in great detail during the 1950’s by many authors such as 
Agmon-Douglis-Nirenberg [1], Agmon [1], Browder [1], Peetre [1, 3], 
Schechter [1]. A thorough treatment was given in the predecessor of this 
book. It was based on methods of the type used here in Sections17.1 and 
17.3, and so is the exposition by Lions and Magenes [1]. 

Calderén [3] outlined another approach to boundary problems using 
singular integral operator theory. It was further developed by Seeley [1]. 
Similar ideas to handle also non-elliptic boundary problems for overde- 
termined elliptic systems by means of pseudo-differential operators were 
presented in Hérmander [17]. Section20.1 follows that paper to a large 
extent. The discussion of the parametrix for the Calderon projector and the 
boundary operator given in Proposition 20.1.5 is closer to Grubb [1] 
though. In the proof of Theorem 20.1.8’ we have also used a case of the 
symbol classes studied in Grubb [2]. 

The index problem for elliptic boundary problems was solved by Atiyah 
and Bott [1]. By a sequence of homotopies of the principal symbol, using 
the given elliptic boundary operators, they defined a virtual vector bundle 
on the boundary of the unit ball bundle in T*(X) and outlined a proof that 
the index of the elliptic boundary problem is equal to the topological index 
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of this extension. Thus they gave a reduction to the index problem on a 
manifold without boundary which was already solved. In Section 20.3 we 
have followed the arguments of Atiyah and Bott [1] closely in principle. 
(See also Luke [1].) However, they have been modified so that each step has 
a natural analytical interpretation. It is interesting to compare with the 
topologically motivated discussion of Atiyah and Bott, so we list an approx- 
imate correspondence between their five homotopies and the presentation 
here: 

First homotopy: Change of degree in Proposition 20.3.4 using Proposi- 
tion 20.3.1. (The latter comes mainly from Boutet de Monvel [2] though.) 

Second homotopy: Proposition 20.2.2 and its application at the end of 
Section 20.3. 

Third and fourth homotopies: Proposition 20.2.1 and its application in 
Step I of the proof of Proposition 20.3.3. 

Fifth homotopy: Step II of the proof of Proposition 20.3.3. 

Already Vi8ik and Eskin [1-5] established Fredholm properties of bound- 
ary problems for elliptic singular integral operators. (See also Eskin [1].) 
Boutet de Monvel [1] introduced the transmission condition, and in [2] he 
developed a theory of elliptic boundary problems for such operators, includ- 
ing a determination of the index. More exactly, he considered operators of 
the form 


A K 
a) C™(X, E)® C~(dX, G) > C™(X, F)@ C*(0X, HB) 
where Q is a pseudo-differential operator in 0X, T is a “trace operator”, K 
is a Poisson operator, that is, the adjoint of a trace operator, and A is 
essentially the sum of a pseudo-differential operator in X° and the product 


of a Poisson and a trace operator. (Here we have only considered a pair 
A 
() The more general setup has the advantage of a better multiplicative 


structure and a better symmetry under passage to adjoints. However, it 
would have taken us too far to develop this approach completely. A very 
detailed exposition has recently been given by Rempel and Schulze [1]. 

The study of non-elliptic boundary problems has largely been motivated 
by their occurrence in the theory of functions of several complex variables. 
(See Kohn-Nirenberg [2].) The study of boundary regularity for the 0 
Neumann problem has been brought to great perfection but is beyond the 
scope of this book. 
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Summary 


We have seen in Sections 64 and 18.1 that the principal symbol of a 
(pseudo-)differential operator in a C® manifold X is invariantly defined on 
T*(X)\0, the cotangent bundle with the zero section removed. When 
discussing the integration theory for a first order differential equation 
p(x,0@/dx)=0, for example the characteristic equation of a (pseudo-) 
differential operator, we also saw in Section 6.4 the importance of certain 
geometrical constructions in T*(X) related to the canonical one form a, 
given by ))é,dx, in local coordinates, and the symplectic form o=dw 
=)dé;dx,;. The integration theory was in fact reduced to the study of 
manifolds <T*(X) of dimension dim X where the restriction of o is equal 
to 0. Maslov [1] has introduced the term Lagrangian for such manifolds. 
The conormal bundle of a submanifold of X is always Lagrangian. The 
extension of the theory of conormal distributions which we shall give in 
Chapter XXV will require a good description of arbitrary Lagrangian 
manifolds. We shall also need to know the structure of submanifolds of 
T*(X) where o has a fixed rank or where the rank varies in a simple way. 
To develop such a study it is useful to start from an intrinsic point of view. 

By a symplectic manifold S one means a manifold with a given non- 
degenerate two form o such that do=0. A classical theorem of Darboux, 
which we prove in Section 21.1, states that S is locally isomorphic to an 
open set in 7*(IR"). If a multiplication (free action) by positive reals is 
defined in S and o is homogeneous of degree 1 with respect to it, then S is 
locally isomorphic to an open conic subset of T*(IR")\0 with multiplication 
by reals in S corresponding to multiplication in the fibers. The one form w 
is invariantly defined in S and defines in S/IR, a contact structure. We shall 
not use this terminology here but rather talk about conic symplectic mani- 
folds which is usually closer to the applications we have in mind. 

A diffeomorphism preserving the symplectic form is called symplectic or 
canonical. In Section 21.2 we give a local symplectic classification of (homo- 
geneous) Lagrangian manifolds as weil as two other classes of submanifolds 
of a symplectic manifold, called involutive and isotropic, which can be 
thought of as larger resp. smaller than a Lagrangian manifold. A classifi- 
cation of pairs of Lagrangian manifolds is also given. 
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Section 21.3 is devoted to classification of certain functions under sym- 
plectic transformations and multiplication by non-vanishing functions. This 
is useful because we shall see in Chapter XXV that these changes of the 
principal symbol of a pseudo-differential operator can be achieved by means 
of a suitable conjugation and multiplication of the operator. Such argu- 
ments will be essential in Chapter XXVI when we study singularities of 
solutions. 

The symplectic classification of geometrical objects is resumed in Sec- 
tion 21.4 with the study of symplectic transformations, or rather relations, 
having fold type of singularities so that they are in general 2 to 2 cor- 
respondences. We then continue with the study of intersecting transversal 
hypersurfaces when the restriction of the symplectic form to the intersection 
vanishes simply on a submanifold of codimension 1. The result, called 
equivalence of glancing hypersurfaces, is essential for a deeper understand- 
ing of mixed problems. However, in this book we shall never rely on any 
of the results proved in Section 21.4. 

Already in Section 21.1 we present a small amount of symplectic linear 
algebra. The topic is pursued in Section 21.6 with a study of the Lagrangian 
Grassmannian, the space of Lagrangian planes in a symplectic vector space. 
The main purpose is the study of the density and Maslov bundles which 
will play a fundamental role in the global machinery to be established in 
Chapter XXV. Another aspect of symplectic linear algebra is discussed in 
Section 21.5 where we study the symplectic classification of quadratic forms. 
This is mainly a preparation for the study of hypoelliptic operators in 
Chapter XXII. 


21.1. The Basic Structure 


In the vector space T*(IR")={(x,é); x,éeIR"} the symplectic form o 
=) dé, dx, is the bilinear form 


G((x; 6), (x, CN = <x, > — <x, oD 
(see Section 6.4). If e; and ¢, are the unit vectors along the x; and ¢, axes 
respectively, then we have for j, k=1,...,n 
(21.1.1) 5(€;,e)J=o(E;,&)=0; o(€;,e,)= —o(@,,£) =O), 


where 6,, is the Kronecker delta, equal to 1 when j=k and 0 when j+k. We 
shall now put this situation in an abstract setting: 


Definition 21.1.1. A vector space S over R (or €) with a non-degenerate 
antisymmetric bilinear form o is called a real (complex) symplectic vector 
space. If S,, S, are symplectic vector spaces with symplectic forms o,,¢, 
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and T: S, S, is a linear bijection with T*o,=<,, that is, 
O(,7)=o,(Ty Ty); yY'ES; 


then T is called a symplectic isomorphism. 


That o is non-degenerate means of course that 
o(y,y)=OVy'eS => y=0. 


We shall now show that every symplectic vector space is symplectically 
isomorphic to T*(IR") (resp. T*(€"); from now on we consider only the real 
case but the complex case is parallel). 


Proposition 21.1.2. Every finite dimensional symplectic vector space S is of 
even dimension 2n and admits a linear symplectic isomorphism T: S > T*(IR"). 


Proof. We just have to show that S has a symplectic basis, that is, a basis 
satisfying (21.1.1). Choose any ¢, +0 in S. Since o is non-degenerate we can 
then choose e,e€S with o(¢,,e,)=1. Let S, be the space spanned by e, and 
&,; it is two dimensional since o(¢,,¢,)=0. Then 


So = {yeS; o(y,S,)=0} 


is of codimension 2 in S and supplementary to S,. In fact, if yeS, 7S, then 
y=x,e, +, e, and 


O=o(y, £\)=—X,, O=a(y,e,)=¢,.- 


S, is a symplectic vector space, for if yeSy and o(y,S )=0 then a(y,S)=0 
because o(y,S,)=0 by the definition of Sy. Assuming that the proposition is 
already proved for symplectic vector spaces of dimension <dimS we can 
choose a symplectic basis e,,&5,...,e,,€, for Sy which gives a symplectic 
basis e,,£,,...,e,,€, for S. The proof is complete. 


A partial symplectic basis can always be extended to a full symplectic 
basis: 


Proposition 21.1.3. Let S be a symplectic vector space of dimension 2n and A, 
B two subsets of {1,...,n}. If {ej}ica» {Eide are linearly independent vectors 
in S satisfying (21.1.1) then one can choose {e;}j¢4,{Edxgs SO that a full 
symplectic basis for S is obtained. 


Proof. Assume for example that Je BN A. We choose e =e So that 
o(e,e)=0, jeA; ofe,e)= —6,,, keB, 


which is possible since {e;} {é,},en are linearly independent and o is non- 


degenerate. If 


jeA? 


xet+) x, e+) 6,¢,=0 
A B 
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then the o scalar product with ¢, is 0, hence x=0. This implies x,;=¢,=0 so 
the extension of the system by e, preserves the linear independence. We can 
argue in the same way if A~ B is not empty. Finally, if A= B+ {1,...,n} and 
S, is the space spanned by {e,},.4 and {é;},.4, then we find as in the proof 
of Proposition 21.1.2 that 


So= {yeS; a(y,S,)=0} 


is supplementary to S, and symplectic. Choosing a symplectic basis in S, 
then completes the proof. 


A coordinate system (x, €)=(x,,...,X,. €1,---@,) in S is called symplectic 
if it defines a symplectic map S— T*(R"). There is an extension theorem for 
symplectic coordinates parallel to Proposition 21.1.3. To prove it we observe 
that if L is in the dual space S’ of S, that is, a linear form on S, then there is 
a unique vector H,éS such that 


<t,L>=o(t,H,), teS. 
One calls H, the Hamilton vector of L. The map S’3sL+>+H,éS is an 
isomorphism, and we define a symplectic form in S’ by 
{L, LD} =o(H,,H,)=<H,, LE). 
It is called the Poisson bracket. If S= T*(IR") then 
H,=) 6L/@é ,6/0x;— > OL/dx,6/0é,, 
{L,L} =) (6L/0E,AL/dx ,—OL/6x ,0L/0€ )) 


by an obvious calculation made already in Section 6.4. The coordinates x, € 
in T*(R") satisfy the so called commutation relations 


(21.1.1Y  — {x;,,x,=0, {6,&}=0, {6 x}= —(x pe} =5,. 


Application of Proposition 21.1.3 to S’ now shows that any partial system of 
linearly independent forms on S satisfying (21.1.1)’ can be extended to a full 
coordinate system in § satisfying (21.1.1). But this means precisely that the 
coordinates define a symplectic isomorphism S > T*(R"). 

We shall continue this brief introduction to symplectic linear algebra in 
Section 21.2 but now we pass on to the analogue of the preceding discussion 
for manifolds. 


Definition 21.1.4. A C® manifold § with a non-degenerate closed C*¥ two 
form o is called a symplectic manifold. If S,, S, are symplectic manifolds 
with symplectic forms ¢,,0, and x: S, >S, is a diffeomorphism with y*o, 
=o,, then y is called a sympiectomorphism, or a canonical transformation. 


An example of a symplectic manifold is T*(X) if X is any C® manifold. 
If ¥ is another C® manifold and x: X > Y is a diffeomorphism, then 


T*(X)3(x, €)+> (K(x), "x (x)! GE T*(Y) 


272 XXI. Symplectic Geometry 


is a symplectomorphism. This is just another way of saying that the sym- 
plectic structure in T*(X) is invariantly defined. 

Our next goal is to prove that every symplectic manifold is locally 
symplectomorphic to T*(IR"). However, before doing so we must make 
some important preliminary observations on the symplectic structure. The 
symplectic form makes the tangent space T,(S) of S at any yeS a symplectic 
vector space, so S is of even dimension 2n by Proposition 21.1.2. If fe C*(S), 
k21, then df is a linear form on T,(S) which defines a Hamiltonian vector 
H,(y). Working in local coordinates we see at once that H, is a C*—’ vector 
field on S. It is called the Hamilton vector field of f. The Poisson bracket of 
f, g€C* is again defined by 

{fg}=HygeCt'. 


To express the condition da =0 we need a lemma. 


Lemma 21.1.5. If @ is any C' two form on a C? manifold M and X,Y,Z are 
three C! vector fields on M, then with the notation [X,Y]=XY—YX 
(21.1.2) CX AYAZ, dm =X CY AZ, WY+ YKZ AX, wy 

+ZX A Y,@>—<[X, Y]AZ, o> 


Proof. Both sides are linear in X, Y,Z,«. If X is replaced by fX, feC', then 
both sides are multiplied by f since 


[fXx, YI=S LX, YI-(YP) x, 
(Yf)<ZAX,w>+(Zf)KX A Yw> + (¥f)<X AZ,w) —(Zf)<X a ¥,oy =0. 


A similar remark is true for Y and Z. It is therefore sufficient to prove 
(21.1.2) when M is a neighborhood of 0 in R%,X,Y,Z are constant vector 
fields, and w= f, df, Adf, with f,eC' and f,, f, equal to coordinates. Then 
the left-hand side is the determinant with the rows (X(fj), Y(f;),Z(4)), J 
=1,2,3, and the right-hand side is the expansion by the first row. 


Now we apply (21.1.2) to the symplectic manifold S$ with w=o and X 
=H,, Y=H,, Z=H,, where f,g,heC?(S). We have 
(H, AH, 0>=0(A,, H,)={g,h} =H,h, : 
<(H,,H,]\H,,0>=(H,H,—H,H)h={ fF, {g,h}} —{8,. {fy} 
so the right-hand side of (21.1.2) becomes 
(1—2)({F, {g, h}} + {g, th S33 + th, CF 8}}). 


Hence the fundamental Jacobi identity 
(21.1.3) {fieh}} tie thsh}+th {fet}=0; fg,hec?(s); 
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follows from the hypothesis do =0 and is in fact equivalent to it. Replacing 
the last term by — {{f/g},h} we can also write it in the equivalent form 


(21.1.3Y Hy j= lH pH. 


This argument shows that (21.1.3) is in any case a first order differential 
operator on h and similarly on f or g. If S=T*(IR") we could therefore prove 
(21.1.3) by noting that it only needs to be verified when f, g,h are coordinate 
functions and then it is obviously true since any Poisson bracket of them is 
a constant so that the second Poisson bracket is zero. However, as already 
emphasized, in our general situation the Jacobi identity expresses the hy- 
pothesis do =0. 

We are now ready to prove that a symplectic manifold is locally sym- 
plectomorphic to T*(IR"). In doing so we observe that if @: $,—-S, is a 
symplectomorphism then 


(21.1.4) O*{FB={O*FO% 8};  fgeC?(S.). 


Indeed, if we use ¢ to identify S, and S, this formula just states that the 
Poisson bracket is determined by the symplectic form. Conversely, (21.1.4) 
implies that *o, and o, define the same Poisson bracket, hence that 6*¢, 
=o, so that ¢ is symplectic. It is of course sufficient that (21.1.4) holds for a 
system of coordinates. To define a local symplectomorphism from S$ to 
T*(R") therefore means choosing local coordinates satisfying (21.1.1). 


_ Theorem 21.1.6. Let S be a symplectic manifold, of dimension 2n, A and B two 
subsets of {1,...,n}. If q;, j€A, and p,, keB, are C® functions in a neigh- 
borhood of y y¢€S with linearly independent differentials satisfying the com- 
mutation relations 


(21.1.1)” {q,.4=0, ijeA; {p.p}3=0, i,keB; 
{Pys Gj} =O je keB, jeA, 


then one can find local coordinates x, ¢ at yg satisfying (21.1.1) such that x, 
=4;, J€A; C,=D,, keB. 


In particular, if we start from the empty sets A and B the theorem shows 
that a neighborhood of 7, in S is symplectomorphic to an open set in 
T*(R"). This is known as the Darboux theorem. 


Proof. Set x,;=q;, j€A; &,=P,, keB. If we apply Proposition 21.1.3 to the 
cotangent space at y, it follows that we can supplement these functions to a 
local coordinate system satisfying (21.1.1)’ at y,. Without restriction we may 
therefore assume that S<IR?" is a neighborhood of y,=0, that q;=x,; when 
jeA, and that p,=&, when keB; moreover (21.1.1) is valid at 0 which 
implies that H, = —0/0¢, and H,,=0/0x, at 0. Assume for example that 
J¢A. Then we can find g;=q with dq=dx, at 0 satisfying the differential 
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equations 
(21.1.5) H,,q={4;q}=0, jeA; H,.g={P..g} =, keB. 


Indeed, by the Jacobi identity (21.1.3)! and (21.1.1)” the vector fields H oe 
H,, all commute, for the Poisson brackets {q;,p,},..., are constant. Hence 
it follows from the Frobenius theorem (see Appendix C, Corollary C.1.2) 
that (21.1.5) has a unique C® solution q near 0 with 


q(x, ¢)=x,; when €,=0, jeA, x,=0, keB, 


for we know that the Hamilton vectors B, H,, at 0 are linearly inde- 
pendent and span a supplementary plane. At 0 we have dg=dx, since 
(21.1.1)' was already fulfilled there. Hence we can replace A by AU {J} with 
q;=q. Continuing in this way we obtain a complete set of symplectic 
coordinates. 


Remark, The important role played by the Jacobi identity in the proof is 
very natural for we know that the theorem would be false if do were not 0. 
Thus the full strength of the Jacobi identity is required. 


With symplectic local coordinates we have c=) dé j4x;, thus 
o"=n!dt,adx,a... Adé, adx, 


since two forms commute. Hence o”/n! is a volume form which can be used 
to orient S and also identify the a-density bundle Q° on S with the trivial 
bundle. In fact, if wy: S,->S, is symplectic, then p*o}/n!=o%/n! so the. 
Jacobian of a symplectic change of variables is one. We can thus identify 
sections of Q* with the functions defining them when symplectic coordinates 
are used. 

Let us also note that the non-degeneracy of o at yeS is equivalent to o” 
+0 at y, if dimS=2n. In fact, the proof of Proposition 21.1.2 shows that we 
can always find local coordinates x, € such that 


k 
Om LAC Adx, at y; 


the construction just breaks off when we have chosen the beginning of a 
symplectic basis e,,€,,-...,@,,€, and o is equal to 0 in the plane orthogonal 
to them with respect to o. This is true in an odd dimensional space as well. 
At y we therefore have of =k!d&, Adx, A... Adé, Adx,+0 while o**+1=0. 
Thus the rank of the form is 2k if k is the largest integer with o* +0 at y. 

In Section 21.4 we shall also encounter degenerate symplectic structures 
for which an analogue of the Darboux theorem is valid. They will be of the 
form discussed in the following theorem, which will not be needed though 
since simplifying additional structure will be present. 


Theorem 21.1.7. Let S be a C® manifold of dimension 2n, let o be a closed 
two form on S, and assume that o"°=mq@ where w is a nowhere vanishing 2n 
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form and m=0, dm+0 at a point y oeS. Assume also that the restriction of 
o"~' to the surface Sy=m—'*(0) does not vanish at yo. Then we can choose 
local coordinates x, € in a neighborhood of yo such that 


o=€,dé, pdx, +2 de; ndx;. 


Thus we would have the usual symplectic form if we could take €?/2 as 
a local coordinate instead of €,. This is of course not permissible since it 
would identify points with opposite values for €, and change the differenti- 
able structure where ¢, =0. 


Proof. We make a preliminary choice of local coordinates such that y, has 
coordinates 0 and m=€,. If i is the inclusion of the surface S, where ¢,=0 
then o, =i*o has rank 2n—2 by hypothesis, so the radical 


{teT,(So); ,(t,t')=O0V'ET,(So)} 


is a line in T,(Sq) for every yeSy near 0. Locally we can therefore choose a 
C® vector field +0 in the radical. Choosing new coordinates so that the 
integral curves of the vector field become parallel to the x, axis, we have 


a= > ajydxjAdxy+ Y rade ndiit Yi eudx; nde. 


1<j<k <j< 


Since do, =0 the coefficients must all be independent of x, for no cancel- 
lation of terms in do, involving dx, can occur. By the Darboux theorem we 
can therefore choose new coordinates x,,€,,...,X,,€, So that 


g,=) dé; ndx;. 
2 
It follows that 


a=dt,aw+>dé,rdx,; when €,=0, 
2 


W=Dajlx dx, + blow eae, 
Here we have written ¢’=(¢,,...,¢,). Thus 
o=SdlE,+ajE,)rdlx)—byE)+a, de, Ad, when &,=0. 
We replace the coordinates ¢, and x, by ¢;+a,;¢, and x,—b;¢, when j>1 
and note that a, =0 since o*=0 when €, =0. 


The form o has now been reduced to 


o=€, 04S dé, ndxp 6=0,+dé, A¢+O(E,) 
2 
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where 6, does not involve dé, or €,. We have 


b= YA (x, E) dx, +) Bix, 6) dg. 
i 2 
Hence 
a=) d(E +A; 65/2) d(x ,—B,E7/2)+ € (A, dé, Adx, +09) + O(€}). 
2 
Taking ¢;+A aa? and x,;—B; é?/2 as new variables we obtain 
ao=6,(A, dé, Adx, +6) + O(E{)+ dé, ndx,. 
2 
Since da =0 we have when €, =0 
dé, A9,=9, 
hence 8, =0 since 6, does not involve €, or d€,. Now 
o"=n!A,€,dé,Adx, a... dé, Adx, + O(E7) 


so A, #0 when €, =0 by hypothesis. Changing the sign of x, if necessary we 
may assume A,>0. If £, A} is then taken as new variable instead of €, we 
have with a smooth two form o, 


G=Oy+E5G,, Gy=E,db, Adx, +) dé, adx,. 
2 


Repeating the argument once more, which we leave for the reader to do, we 
obtain with new coordinates o=o,+€3 0, where , is also smooth. 

If feC® the Hamilton vector field H, with respect to o is defined when 
€,+0. With respect to o) we get the Hamilton vector field 


H9=E; MOf/BE, O/Ax, —Of/Ax, a/08,)+ ¥.(Of/0%,a/0x,—df]0x, /0E,) 
as is obvious if €?/2 is used as a new coordinate. We can write 
€,63(t,H,)=o(t, TH,) 
where T is a linear transformation in R?" with smooth coefficients. Hence 
oolt, I + €{T) Hy) = o(t, Hy) = (t, df) = oo(t, Hp) 


which proves that H,=(I1+¢{T)~'H}. Thus H,-H?=€,R, where R, is 
smooth. 

Following the proof of Theorem 21.1.6 closely we shall now choose local 
coordinate functions q,,...,4,, P1,--->D, such that 


o=p,dp, Adq,+dp,Adq,+...+dp,Adq,; 
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that is, so that we have the commutation relations 
Py; Hy, Pj=Pi H,,9;=0, j>1; 
A, Pe=Ay,4.=9, Hy, d=6%, J, k>1; 
Pi Ag =1, A,,9;=H,,4,:=0, j>l. 


We set p,=¢, and then choose p,,...,p,, 425---59, im arbitrary order, 
leaving the choice of q, to the end. We claim that this can be done uniquely 
with boundary conditions posed exactly as in the proof of Theorem 21.1.6, 
and that p;=¢,;, q;=x; when ¢,=0 for all j. Suppose that this is already 
shown until a certain step in the construction which is still mcomplete. 
Then we have to find the next coordinate satisfying first of all the differen- 
tial equation 


p, H,,u=0du/dx,+€}R,,u=0. 


This equation is also satisfied by the other coordinate functions f already 
chosen so 47 f/éx,6€,=0 when €, =0. Hence 


f=fotDihitvihe 


where f, and f, are independent of x, and €,, and 


Ay=H,,.+pilAs,+Ps Ay )+ hi +27 fy) Ap,- 
The equation H ,u=c may now be replaced by the equation 
(Hy, +p(H,, +p, H,,))u=c 


which means Hf.u=c when €,=0. This will not affect the Frobenius 
condition when ¢€,+0 so it will be satisfied also when €,=0 because of 
linear independence. Hence the construction of u is done as before, and 
since our old coordinates satisfy the required equations when €,=0 we 
maintain the conditions p,=¢,, q;=x,; when €,=0 every time we choose a 
new coordinate function. Thus we can carry the construction on until it 
gives a system of local coordinates. 


The definition of a symplectic manifold was motivated by the properties 
of a cotangent bundle T*(X) but omitted a great deal of its structure: 
T*(X) is also a vector bundle and the symplectic form vanishes in the 
fibers. We have even o=dq@ where is a one form which also vanishes in 
the fibers. We can recover @ from o by noting that 


o(p,t)= a(t) 


if tf is a tangent vector field and p is the radial vector field which in local 
coordinates is given by p=)'€,0/06,. It is invariantly described by 


d 
ef=aMiflent 


278 XXI. Symplectic Geometry 


where M,(x,¢)=(x,t ) is multiplication by t in the fibers of T*(X). Note 
that M*o=to. Although we shall continue to ignore much of the structure 
of T*(X) this multiplication will play an important role in our applications 
so we introduce the following terminology. 


Definition 21.1.8. A C® manifold S of dimension N is called conic if 

{i) there is given a C*® map M: R*xS-—S; we write M,(y)=M(t,y); 
teR*, yeS, 

(ii) for every yeS there is an open neighborhood V with M(UR* x V)=V 
and a diffeomorphism ¢: V-I, where F is an open cone in R*~0, such 
that 

to=oM,, t>0. 
By a conic symplectic manifold we shall mean a conic manifold which is 
symplectic and for which M* o=to where o is the symplectic form. 


Note that the definition implies M,M,=M,, since this is true in R%~0. 
What we have assumed is therefore a free group action of R,. An example 
of a conic symplectic manifold is T*(X)~0, the cotangent bundle of X with 
the zero section removed. The required maps into R?"\0 are given by 
(x, €)r+(x|é], €) in terms of the usual local coordinates. We shall prove that 
for every conic symplectic manifold there is a local symplectomorphism 4, 
homogeneous of degree 1, on an open conic subset of T*(IR")~0; the 
homogeneity means of course that @ commutes with multiplication by 
positive reals as in part (ii) of Definition 21.1.8. However, we must first make 
some remarks on the definition. 

First we note that in a conic manifold a radial vector field p is defined 
by 

pf =SMPf has; 


with the notation in Definition21.1.8 we have $,p=) y,0/dy, for if 
FeC}(R%) then 


d 


d 
qe FO) a1 = 7 FEW 1 = Lj OF /0y;- 


If a function f is homogeneous of degree yp, that is, M* f=t"f, then differen- 
tiation gives pf =f (Euler’s homogeneity relation). Conversely a solution 
of this equation in a suitable neighborhood V of a point in S can be 
extended uniquely to a function homogeneous of degree y in the conic 
neighborhood generated by V. This is perfectly obvious when ScIR* and V 
is convex. 

Now assume that S is a conic symplectic manifold. The condition M*o 
=to means that 


(21.1.6) tM*{fu}={(M*fM*u}; f, ueC(S). 
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To prove this we choose a positively homogeneous positive function T of degree 
1 and set 0; = o/T. Then M,*o; = to/tT = 01, so for the Poisson bracket with 
respect to 0, we have (21.1.4). Thus 


Mi (T{fus)=T{ MES, MF u} 
which gives (21.1.6). Differentiation of (21.1.6) gives when t=1 


(21.1.6) {fujt+p{fu}={efuy +h pu}, 
that is, 

(21.1.6)” A,+[p.H,J=H, 5. 

In particular, if f is homogeneous of degree » then 
(21.1.6)” [H,,p]=(1—y)HA,. 


We shall now modify Theorem 21.1.6, taking into account that the stand- 
ard local coordinates x, € in the cotangent bundle are homogeneous of 
degree 0 and 1 respectively. 


Theorem 21.1.9. Let S be a conic symplectic manifold of dimension 2n, A and 
B two subsets of {1,...,n}, and q;, j€A; p,, keB; C™ functions in a conic 
neighborhood of yéS such that 

(i) q, and p, are homogeneous of degree 0 and 1 respectively, 

(ii) the commutation relations (21.1.1)” are valid, 

(iii) the Hamilton’ fields H ap pn with je¢ A, keB, and the radial vector 

field p are linearly independent at . 

Let a,, b, be arbitrary real numbers such that 


(iv) qjy)=4,;, JeA; p,(y)=b,, keB, 


and assume that b,+0 for some J¢éA. In a conic neighborhood of y one can 
then find C® functions q,, j¢A, and p,, k¢B, so that (i), (ii), (iv) remain valid 
for indices running from 1 to n and (iii) remains valid if j+J. In particular, 
(q, p) defines a homogeneous symplectomorphism of a conic neighborhood of y 
in S on a conic neighborhood of (a, b)eT*(R")~ 0. 


Note that starting from the empty sets A and B we conclude in particu- 
lar that every conic symplectic manifold is locally isomorphic to T*(IR”)\0. 
The prescription of the values of q,(y) is not of real interest since one can 
add constants to q, without affecting (i), (ii), (iii). From (i) it follows that 


o(p, H,,)=0, o(p, H,,)=Dy- 


Since o(H,,,H,,)=0 when i,jeA the symplectic form vanishes in the plane 
spanned by H ay JEA, and p, which must therefore be of dimension <n in 
view of Proposition 21.1.3. Hence (iii) implies that A has at most n—1 
elements. For the extended system we must have 


p=) cH, 
1 
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where —c,=<p,dp;> =pj;. If (iii) is valid it follows that the conclusion in the 
theorem is false unless b,-0 for some j¢ A. 


Proof of Theorem 21.1.9. It suffices to construct q;, p; in a neighborhood of 
satisfying the required conditions with (i) replaced by , 


(i) 24;=9,  PPy=Pis 


for q;, p, can then be extended to homogeneous functions in a conic 
neighborhood of ». By hypothesis the Hamilton vectors ¢;= —H,,(y), & 
=H,,(y) in the symplectic vector space W = T,(S) satisfy the conditions 


(a), (iv) a(p,é)=0, jeA; a(p,e)=b,, keB; 

(ii) a(é,,€)=0, i,jeA; a(e;,e,)=0, i,keB; 
o(é;,€,)=6,, Jed, keB; 

(iti) &;, j€A; e@, keB; and p are linearly independent. 


The first step in the proof is to show that one can choose £;, jfAU {J} and 
é,, K€B so that these conditions are preserved. This is just a problem in 
linear algebra, which we solve by looking at a number of cases: 

a) If ke ANB we choose e=e, satisfying the equations 


o(p,e)=b,; o(e;,e)=0, ieB; a(é;,e)=5,,, jeA. 


This is possible since o is non-degenerate and (iii) is fulfilled. Since o(¢,, e) 
+0 but ¢, is o orthogonal to p and to ¢,, je A, and to e,, Je B, the linear 
independence is preserved when we add e, to our system. 

b) If keANB then W=W,@W, where W, is spanned by e, and ¢, and 
Wo is the symplectically orthogonal space of e; and ¢,. Correspondingly we 
split p=p)+p, and obtain the same conditions for p, and the remaining 
vectors ¢,, e; in Wo. The statement then follows from a lower dimensional 
case. 

c) If A is empty and k¢B we have to choose e=e, so that 


a(p,e)=b,, a(e,,e)=0, ieB. 


The solutions form an affine space of dimension 2n—|B|—1 if |B] is the 
number of elements in B. It must contain elements outside the span of p 
and e,, ieB, unless |B|=n—1 and the equations are satisfied by all elements 
in this plane. This implies 


0=b,=90(p,e)=5, 


for all i¢B, which contradicts the hypothesis that some b,+0, which was 
preserved in the reduction made in case b). 

d) If B={1,...,n} and A is empty, n>1, we choose e=e, for some j+J 
so that 


a(p,e)=0, o(€,,e2)=—6,, kK=1,...,n. 
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E=Cpt+) Cy, e. 
1 
Then 
6, =O (8, e,)=C o(p,e,)=c by. 


Taking k=j we find that c+0 but this is a contradiction when k=J since b, 
+0. Hence the linear independence is automatically preserved. 

A combination of the preceding steps completes the construction of the 
vectors ¢,, j+J, and e,, and we then choose ¢, so that (i)’, (iv), (ii) remain 
valid. It remains to modify this infinitesimal solution to a construction of 
actual coordinate functions. To do so we start by choosing local symplectic 
coordinates at y so that x;=q,;, jéA, ¢,=p,, keB, and H,(y)=—e,, Hz,(y) 
=e, for all j, k. Then we have 


p=%. 60/00 ;+ 01; P1=>.F;0/0E;+ > 8, 0/0x, 


where the coefficients of p, are independent of ¢,, j¢ A, and x,, keB, because 
(21.1.6) gives 


[4/0¢,, o,]= —LH,,,p]—0/0&,= —H,, + H,,=0, 
[6/0x,, Pil — [es Pp] =0. 


To choose a new coordinate function, say u=p,, K¢B, means to find a 
solution of the equations 


OujOe;=bjx, jeA; Ou/Ox,=0, keB; pu=u. 


If we set u=¢,+v this means that v is independent of ¢; and x, and that 
P20—-0=Ex— )) Ox; 
jeA 


where p, is obtained from p, by dropping 0/0¢,, je A, and 0/0x,, keB. For v 
we therefore have a differential equation in the remaining variables, and p, 
+0 by (ili). Hence the equation can be solved with prescribed initial data 
on a surface transversal to p, through y in the plane of the variables ¢,, 
J€A, x,, kEB. We can then give du any value compatible with the equations 
at y, in particular take du=d€, at y. The construction of a coordinate 
function q,;, j£A VU {J} is exactly parallel. Finally, when all p, and all q,; with 
j+J are known, we obtain q, by solving an ordinary differential equation in 
€,. This completes the proof. 


With the coordinates constructed in Theorem 21.1.9 we have a homo- 
geneous local symplectomorphism to T*(IR"). There p is the Hamilton field 
of the one form w=)’ €,dx, which is therefore invariantly defined on any 
conic symplectic manifold. In fact, w(t)=a(p,t) if t is a tangent vector. We 
have dw=o since this is true in T*(R")~.0, so the symplectic form is always 
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exact on a conic symplectic manifold. The condition (iii) in Theorem 21.1.9 
is often stated in the equivalent form that dq,, dp, and @ are linearly 
independent at y if je A, keB. 


There is also a homogeneous version of Theorem 21.1.7 which we give 
for the sake of completeness; it will never be used here. 


Theorem 21.1.10. Let S be a conic manifold of dimension 2n with a two form o 
satisfying the hypothesis in Theorem21.1.7 such that o is homogeneous of 
degree 1 and the radial vector p(y) is not in the radical of o restricted to So. 
In a conic neighborhood of yo one can then choose coordinates €, homo- 
geneous of degree 3, €>,...,€, homogeneous of degree 1 and x, homogeneous 
of degree 0 such that the coordinates of ),. are x =0, €=6, and 


o=€,dé, Rax Da Nex 


Proof. Very little has to be changed in the proof of Theorem 21.1.7 so we 
shall only review it briefly. The surface S, defined by m=0 is obviously 
conic. By hypothesis the radical of the two form a; in So is not generated by 
the radial vector field, so we can take a conic hypersurface S; C So through yo 
which is transversal to the radical. The restriction of o, to S, is then non- 
degenerate and makes S, a conic symplectic manifold. By Theorem 21.1.9 we 
can therefore choose homogeneous symplectic coordinates x,,...,x,, 
€,,...,€, in S, with €,=1 and the others 0 at y). We extend these functions 
to a conic neighborhood of S, in S, so that they are constant along the 
integral curves of the radical, which preserves the homogeneity properties, 
and we choose x, homogeneous of degree 0 and 0 at 7, with restriction to 
Sg vanishing simply on S,. Finally we choose €, homogeneous of degree 4 
vanishing simply on S,. The restriction of o to Sy is then equal to 


Ydd, nds 


Since the successive reductions made in the proof of Theorem 21.1.7 are all 
unique it is quite obvious that they preserve the homogeneities. The verifi- 
cation of this is left for the reader. 


Example 21.1.11. The situation in Theorem 21.1.10 occurs if in T*(IR"**) we 
form the intersection of the surfaces defined by x, =0 and 


CTX eet ona oe 


The intersection is transversal when ¢,>0, and the restriction of the sym- 
plectic form to the surface is 


o=dt,Adx,+...+d&,rdx,+d(E/E,) rdx,,1- 
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Taking x41, X2.-+:X_» €1(2/€,)%, €,...,€, a8 variables on the surface we 
have the situation described in Theorem 21.1.10. An equivalent formulation 
of Theorem 21.1.10 is therefore that a reduction to this example is always 
possible. We shali return to the example in Section 21.4. 


21.2. Submanifolds of a Symplectic Manifold 


As in Section21.1 we begin with a brief discussion of linear algebra in a 
symplectic vector space S, with symplectic form a. Just as in a Euclidean 
space a linear subspace V of S has an annihilator V’ cS with respect to the 
symplectic form, 
V? = {yeS; o(y, V)=0}, 

and (V’)"=V. However, in general V is not a supplementary space of V’. 
Since dim V+dim V’=dimS this is true if and only if VAV’ = {0}, that is, 
V is symplectic in the sense that the restriction to V of the symplectic form 
is non-degenerate. In that case V’ is of course also symplectic so S is the 
direct sum of the symplectic subspaces V and V’. This observation was used 
already in the proof of Proposition 21.1.2. In general there is a substitute for 
this construction: 


Proposition 21.2.1. If V is a linear subspace of the symplectic vector space S 
then 


(21.2.1) S=V+tVIYVOV’) 

is a symplectic vector space, and 

(21.2.2) dim S’=dim § —2 dim(Vn V’)=2 dim(V + V")—dimS. 

Proof. The o orthogonal space of W=V+V’ is F=VE (VY 


=VOAV'CW. If weW then a(w, w)=0 for all w'e W if and only if weW’, so 
o induces a non-degenerate antisymmetric bilinear form o’ on S’. Since 


dimW+dimW’=dimS, dimW—dimW’=dimS’,, 
we obtain (21.2.2). 


In particular, we have dimS’=dimS—2dimV if V’>V and dimS’ 
=2dimV—dimS if V?cV. These cases occur so frequently that they have 
special names: 


Definition 21.2.2. A linear subspace V of a symplectic vector space is called 
isotropic, Lagrangian resp. involutive (or coisotropic) if VcV’, V=V’ and 
V>V’ respectively. 
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The dimension of V is thus <dim$/2, =dim $/2, and =dimS/2 in these 
three cases. An isotropic (involutive) subspace V is Lagrangian if dimS 
=2dim V. 


Proposition 21.2.3. V is isotropic (involutive) if and only if V is contained in 
(contains) a Lagrangian subspace. 


Proof. Assume first that V is isotropic and choose a basis e,,...,e, for V. 
Then o(e,,e))=9, i,j=1,...,k so by Proposition 21.1.3 we can extend this to 
a symplectic basis e,,...,@,, €,,...,8, for S. Then e,,...,e, span a Lag- 


rangian subspace containing V. If V is involutive then V®% is isotropic, so 
V’ cA for some Lagrangian plane, hence Ac V. 


S’ always splits in a natural way into the direct sum of two orthogonal 
symplectic subspaces V/(VAV’) and V°(VoOV’). The symplectic form of S 
vanishes on VO V’. Now start by choosing a basis e,,...,e, for VOV" as in 
the proof of Proposition 21.2.3 and extend it by elements e,,,,...,€¢4), 
&e41>-+-2€%4) Whose residue classes form a symplectic basis for V(VOV’). 
Next add elements e)45445--::€n> &es14y9++0@, Tepresenting a symplectic 
basis in V’?/(VOV’). We have then obtained a basis for V+V*° satisfying 
(21.1.1). Since dim S=dim(V + V7)+dim(VnN V"’)=2n we can now use Prop- 
osition 21.1.3 to add elements ¢,,...,¢, to get a symplectic basis for S. With 
corresponding symplectic coordinates V is then defined by 


Cp HOO, Xe Xn Seg = HO =O. 


The dimension is k+2] while the dimension of S’ is 2(n—k) so it is not in 
general determined by the dimension of V. 
The preceding discussion can be extended to manifolds: 


Theorem 21.2.4. Let S be a conic symplectic manifold, V a conic submanifold 
such that the restriction of the symplectic form to V has constant rank 21 in a 
neighborhood of a point y where the canonical one form w does not vanish 
identically on T,(V). Then there are homogeneous symplectic coordinates x, ¢ 
in a conic neighborhood of y in S such that V is defined by 


(21.2.3) €,=...=€,=0, app, =e =X, = Seer = =F, =0 
where k+21=dim V. 


Note’that 140 for V cannot be isotropic since the tangential radial 
vector p(y) by hypothesis is not orthogonal to T,(V). There is an obvious 
non-homogeneous version of the theorem where w does not occur so that 
this condition drops out. We shall usually leave such variants for the reader. 


Proof. We can choose f,,..., f,eC™ homogeneous of degree 1 and vanishing 
on V so that df,,...,df, are linearly independent at y and define T,(V). Here 
r=2n—dim V where 2n=dim S. Set k=dim V—21. 
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a) If k+I<n it follows from the discussion of the linear case that some 
Poisson bracket, say {f;,f,}, does not vanish at ». Thus H,, cannot be 
proportional to the radial vector p, since pf,=0. By Theorem 21.1.9 we can 
therefore choose local homogeneous symplectic coordinates such that ¢,=/f, 
and y corresponds to (0,¢,), ¢,=(1,0,...,0), say. Then the equation f,=0 
can be solved for x,, since 


Of,/OX,={hi,foy#O at y. 


In a sufficiently small conic neighborhood of y the equation f,=0 can 
therefore be written in the form 


X= B(%15E 45-005 6n 15 on) 


where g is homogeneous of degree 0. If X,=x,—2(%1,61,---56,-15&,) We 
have €,=0, X,=0 on V, and {&,,X,}=1 in a neighborhood of y. Using 
Theorem 21.1.9 again we can find a new homogeneous symplectic coor- 
dinate system where X, and €, are the last coordinates. Changing notation 
we may therefore assume that x,=¢€,=0 on V. This means that V can be 
considered as a submanifold of T*(IR"~*), and the theorem follows if it has 
already been proved for symplectic manifolds S of lower dimension. 

b) If k+l=n but k+0, that is, V+S, we have {f,,f;}=0 on V for ij 
=1,...,r by the discussion of the linear case, and by hypothesis p is not a 
linear combination of H,,,...,H,,. We choose homogeneous symplectic 
coordinates such that f,=¢, and y has coordinates (0,¢,), say. Then €,=0 
on V and 0f,/0x,={6,,f}=0 on V, j=2,...,r. Hence V is cylindrical in the 
x, direction and defined near y by €,=0, FO, X25 -0+y% yO, Cas oer Sy =O, J 
=2,...,r. The differentials of these functions and w are linearly independent. 
If the theorem is already proved for lower dimensional S we can choose 
new coordinates x3,...,X,,62,...,€, such that Va{x,=€, =0} is defined by 
é€,=...=€,=0, and this completes the proof. 


In part b) of the proof we found that V was generated by a family of 
curves. These are manifestations of an important foliation which we shall 
now discuss. (See Appendix C.1 for terminology.) 


Definition 21.2.5. A submanifold V of a symplectic manifold is called sym- 
plectic, isotropic, Lagrangian resp. involutive if at every point of V the 
tangent space of V has this property. 


Remark. Note that the constant rank condition of Theorem 21.2.4 is auto- 
matically fulfilled in these cases. 


Theorem 21.2.6. If V satisfies the hypothesis in Theorem 21.2.4 then the sub- 
space T,(V)\T,(V) of T,(V) defines a foliation with isotropic leaves transver- 
sal to the radial vector field, of dimension dim V — rank ol,.. 
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Proof. In the local coordinates of Theorem 21.2.4 we have the foliation of V 
by the parallels of the x, ... x, plane. 


The reader is advised not to ignore this important result because of the 
brevity of the proof; it really depends on all we have done so far in this 
chapter, and the foliation will be used very frequently. 


As pointed out after the statement of Theorem 21.2.4 there is a non- 
homogeneous version of that result so we have a foliation of any sub- 
manifold V of a symplectic manifold S such that the symplectic form 
restricted to V has constant rank. If S and V are conic and the radial vector 
field p is tangent to a leaf B of the foliation at some point y, then B is conic. 
In fact, the ray through y must stay in the leaf since the homogeneity 
implies that p is in the plane of the foliation along it. The homogeneity also 
implies that M,B is a leaf of the foliation for any t, if M, denotes multiplica- 
tion by t; since it has points on the ray through y in common with B it 
must agree with B. Hence we have the following supplement to Theo- 
rem 21.1.6. 


Theorem 21.2.7. If S is a conic symplectic manifold, V a conic submanifold 
such that the symplectic form restricted to V has constant rank, then 
T,(V) Ty (V) defines a foliation of V with isotropic leaves which are either 
transversal to the radial vector field or else conic. 


When V is involutive the constant rank condition is automatically ful- . 
filled, as already pointed out, and the leaves of the foliation are then 
generated by the integral curves of the Hamilton fields H, of functions 
vanishing on V. Indeed, we have H,(y)eT,(V)=T,(V) 0 Ty (V) when yeV then. 
The foliation is therefore usually called the Hamilton foliation. 

If V is a submanifold of a symplectic manifold S, such that the symplec- 
tic form restricted to V has constant rank, then we have a natural vector 
bundle S, on V with fiber 


(T,.V)+ TV MTV) 9 TV) 


at yeV. The fiber is a symplectic vector space so S, is called a symplectic 
vector bundle. In the local coordinates of Theorem 21.2.4 S,, is the product 
of V and the symplectic vector space T*(IR"~*). It is easy to see that we 
have in fact a vector bundle with these local trivializations. Note that the 
restriction of S, to a leaf B of the foliation of V is equal to the bundle S,, 
so the construction is mainly of interest in the isotropic case where the 
foliation is trivial (the leaves become all of V). For a manifold which is 
neither isotropic nor involutive the symplectic bundle S, is in a non-trivial 
way the direct symplectically orthogonal sum of two symplectic bundles S, 
and Sj, with fibers 


TVET V)OT eV) and TeVI(T(V)A TV). 


21.2. Submanifolds of a Symplectic Manifold 287 


There is also an analogue of Theorem 21.2.4 when the restriction of w to 
V is identically 0. This implies that the restriction of c=dw to V vanishes, 
so V is isotropic. Conversely, if V is isotropic and conic then the restriction 
of w to V is 0, for w(t)=a(p, t)=0 if t is a tangent vector of V, since p is also 
atangent vectorofV. ~ 


Theorem 21.2.8. Let S be a conic symplectic manifold, V a conic isotropic 
submanifold. In a conic neighborhood of any yeV one can then find homo- 
geneous symplectic coordinates such that V is defined by 


(21.2.4) x,=...=x,=0, & ,=...=€,=0 


where k is the dimension of V. 


Proof. We shall first prove that the coordinates can be chosen so that V is 
defined by 


(21.2.5) x,=0, €,=...=&=0, %.,=¢,,=...=x,=¢,=0. 


Part a) of the proof of Theorem 21.2.4 is applicable with no change and 
reduces the proof to the case where dim V=n, that is, V is Lagrangian. If 
n > 1 we can still choose one of the functions Ki vanishing on V so that H, 
is not proportional to p, so the reduction in part b) of the proof ot 
Theorem 21.2.4 is applicable. Finally, if n=1 the fact that the one form 
€,dx, vanishes on V implies that x, is a constant which we choose as 0. 
Since there is a local homogeneous symplectomorphism mapping any conic 
isotropic V of dimension k, and in particular (21.2.4), to the same model 
given by (21.2.5), we can also choose (21.2.4) as our model. 


In particular any conic Lagrangian manifold can be defined locally by 
x=0, that is, as the fiber T{(IR")\ 0 in a suitable system of homogeneous 
symplectic coordinates. Note that if X is any C” manifold and YcX is a 
submanifold, then the conormal bundle N(Y) of Y is Lagrangian since 
<dx,€>=0 on T,(Y) if €EN,(Y). The theorem is therefore applicable in 
particular to any conormal bundle. The following theorem shows that any 
conic Lagrangian submanifold of a cotangent bundle contains large open 
subsets of this type. 


Theorem 21.2.9. Let X be a C®” manifold and V a C®™ conic Lagrangian 
<T*(X)N0. If (x9,E))EV and the restriction to V of the projection n: 
T*(X)—>X has constant rank r in a neighborhood of (X9,o), then there is a 
C® submanifold Y< X of dimension r in a neighborhood of xX. with conormal 
bundle equal to V in a neighborhood of (Xo, €o). 


Note that r<n since 2, =O for the radial vector field which is a 
tangent of V. 
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Proof. By Proposition C.3.3 a neighborhood of (x9, ) in V is mapped by z 
on an r dimensional C® manifold YcX. Since }'é;dx;=0 on V and 2, 
maps the tangent space of V onto that of Y, it follows that € must be a 
normal to Y, that is, VCN*(Y) in a neighborhood of (x, €,). Equality must 
hold since the dimensions are the same. 


The hypothesis in Theorem 21.2.9 is always fulfilled in an open dense 
subset of V where the rank is equal to the maximum in any neighborhood. 
However, in general the projection of V can be quite singular. A simple 
example is the Lagrangian in T*(IR7)\0 defined by 


(21.2.6) Mase we H2G Ve) 6740: 


The projection is the curve (x,/3)?—(x,/2)?=0, and V is the closure of the 
conormal bundle of the regular part. 


We shall now derive a normal form for intersecting conic Lagrangians. 
The intersection is never transversal, for the canonical one form vanishes on 
both tangent planes at the intersection. We must therefore use the notion of 
clean intersection discussed in Appendix C.3. 


Theorem 21.2.10. Let S be a conic symplectic manifold of dimension 2n, V, 
and V, two conic Lagrangian submanifolds intersecting cleanly at yeS. Then 
one can choose homogeneous symplectic coordinates x, € at y such that the 
coordinates of y are (0,€,), €;=(1,0,...,0) and near (0, €,) 


(21.2.7) Vi ={0,9}, v= {0,x", 60} 


where x’ ==(X1,-..,X,), X” =(%,41,---,X,) and k is the dimension of V,V, (or 
equivalently the excess of the intersection ). 


Thus V, is locally the conormal bundle of a point and V, the conormal 
bundle of a manifold of codimension k through it. Note that the conormal 
bundles of manifolds intersecting cleanly must also intersect cleanly since 
they are linear spaces in coordinates where the intersecting manifolds are 
linear spaces. 


Proof. By Theorem 21.2.8 we may assume that S is a conic neighborhood of 
(0,¢,) in T*(R")\0 and that V, is defined by x =0. The intersections V/, Vz 
and VinVs of V,,V,,V,AV, with the sphere bundle S,={(x,é); {€|=1} 
have dimensions decreased by one, and the direct sums of their tangent 
spaces and p are equal to those of V,,V,,V,qV, at S,. Hence Vf and Vj 
intersect cleanly with k—1 dimensional intersection. 

a) Assume first that k=1. Then the intersection is 0 dimensional which 
means that the tangent spaces of Vf and Vj have only 0 in common. Since 
Vf is the sphere defined by x=0 it follows that the differential of 1: Vj— IR” 
is injective, hence of rank n—1. By Theorem 21.2.9 it follows that V, in a 
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neighborhood of (0,¢,) is the conormal bundle of a hypersurface which we 
can take as the surface x, =0 by a change of variables in IR". 

b) Assume now that k>1. Then Vf and Vj are of dimension n—1 and 
have a k—1 dimensional clean intersection in a 2n—1 dimensional ma- 
nifold, so the excess is k by (C.3.3). Hence we can find k=2 functions 
fi>-->f,, homogeneous of degree | which vanish on V, UV, and have linearly 
independent differentials at y. Then H,, and p must be linearly independent 
for some j, say j=1. As in part b) of the proof of Theorem 21.2.4 we can 
choose new homogeneous symplectic coordinates such that f, =¢, and con- 
clude that V, and V, are both locally equal to the product of the x, axis and 
Lagrangians in x,=—¢,=0 intersecting cleanly of dimension k—1. If the 
theorem is already proved for smaller values of k and n we can choose new 
coordinates x,,¢,,...,X,_15%,_, so that V, and V, are defined near (0,¢,) 
by 

Vii xX~=... =X 
Va: xX,=...=xX,_,=0, €,=...=¢,=0, 


which intersect in the plane where all variables except x,,&,,...,€,_, are 0. 
In particular, the pair in (21.2.7) can be presented in this form which 
completes the proof just as in the case of Theorem 21.2.8. 


Theorem 21.2.10 is of course also valid in the non-homogeneous case. 
The linear case is very elementary but instructive: If V, and V, are Lag- 
rangian subspaces of a symplectic vector space S then we can choose a 
symplectic basis ¢,,...,€,,&,---,&, such that V, is spanned by ¢,,...,¢, and 
V, is spanned by @,44,---,@,5&15+++5&- To do so we first choose a basis 
£,,-.-,€, for V, VV, and extend it to a basis ¢,,...,¢, for V,. The symplectic 
form restricted to V,xV, gives a duality between V,/(V; NV.) and 
V/V, V2). Hence we can choose e,,,,...,¢,€V, forming a dual basis to 
&j419+++5€,, and then we just have to extend to a full symplectic basis using 
Proposition 21.1.3. The following consequence will be useful below: 


Corollary 21.2.11. Let S be a symplectic vector space and V,, V, two 
Lagrangian subspaces. Then we can find a Lagrangian subspace V transversal 
to both. 


Proof. lf V, and V, have the form (21.2.7) we can take 
V={(x, ¢); x” =O", C=O}. 
If S, and S, are symplectic manifolds then a symplectomorphism or 
canonical transformation x: S,-S, is a diffeomorphism with 7* o,=o, if a, 


is the symplectic form in S$, (Definition 21.1.4). This implies that S, and S, 
have equal dimensions. The graph of x 


G={(x()), 7); PES2} CS, x Sz 
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is a Lagrangian submanifold of S, x S, with the symplectic form 
O12 =NT 01-150, 


where 2,: S,xS,—>S, and 2,: S,xS,-S, are the two projections. (In 
what follows we shall usually write o, instead of n¥o, when no misunder- 
standing seems possible.) In addition 1, restricted to G is a diffeomorphism 
on S, for j=1,2. By dropping this condition we can extend the notion of 
canonical transformation and in particular drop the condition that S, and 
S, have equal dimensions. 


Definition 21.2.12. Let S,, S, be symplectic manifolds with symplectic forms 
0,,0,. A Lagrangian manifold GcS,xS, with respect to the symplectic 
form o,—@, is then called a canonical relation from S, to S,. If S, are conic 
and G is conic we call the relation homogeneous. 


As in Section 5.2 a relation maps a set ECS, to a set G(E)cS,, 
G(E)={7,€S13(71,72)€G for some y,€E}=2,(G7n;' E). 


Relations can be composed as functions: If G,cS,xS, and G,cS,xS, 
then 


G,oG,={(71,73)3 (71, 72€G, and (y2,73)€G, for some y,}. 


We can also interpret this as the projection on S$, x S, of the intersection of 
G,xG, with S, x A(S,)xS, where A(S,)= {(y, y); yES,} is the diagonal in S, 
xS,. Note that A(S,) is isotropic with respect to the symplectic form 
—1%,0,+73,0, where 7,, is the projection S,xS,-S, on the j factor. 
There is of course no reason in general why G,°G, should be smooth but 
we shall prove that for canonical relations G, and G, we always obtain a 
smooth canonical relation if the intersection is clean and suitable global 
restrictions are made. First we study the construction in the linear case, in a 
somewhat more general context. 


Proposition 21.2.13. Let S be a symplectic vector space with symplectic form o, 
let A be an isotropic subspace, and denote by S' the symplectic vector space 
A’/A. If ACS is any Lagrangian subspace of S then N =(AN AANA) is a 
Lagrangian subspace of S’. 


Proof. That 4’ is isotropic follows from the fact that 4 is isotropic and from 
the definition of the symplectic form in S’. Now 
dim 1+ dim 4° =dim(An A’)+. dim(A+ A’) 
=dim(An A’)+dim S —dim(1n A), 
hence by (21.2.2) 
dim 4’ = dim A—dim A =(dim S’)/2. 
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Remark. Although the map j+-/’ is always defined it is not continuous 
when 47 A’ changes dimension. As an example consider S = T*(R7), 


A={(0,x,,0,0)}, A? ={(x,,x2,€,,0)}. 


Then A= {(ax,,x,,€,, -a€,)} is Lagrangian for every a but 2’= {(x,,0)} ifa 
+0, 2’ = {(0,€,)} if a=0. 

We are now prepared to discuss the composition of general canonical 
relations. 


Theorem 21.2.14. Let S; be a symplectic manifold with symplectic form o,, j 
=1,2,3. If G,cS,xS, and G,<S,xS, are Lagrangians for the symplectic 
forms ¢,-—0, and 6,—06,, and G, xG, intersects S, x A(S,) x S, cleanly in G, 
then the projection x from G to S,xS, has rank (dimS,+dimS,)/2 and the 
range G,°G, of 7 is locally a Lagrangian manifold with respect to 0, —63. 
Under these hypotheses we shall say that the composition is clean. 


Proof. Let (7,,72,¥2.73)€G. By the definition of clean intersection the tan- 
gent space of G there is 


ANT, (81) x A(T,,(82) X T,(Sa)s_— A= Ty, (G1) X Ty, (Ga). 


Yi. V2 


A is a Lagrangian subspace of the symplectic vector space 
T,,(S1)® T,,(S2) ® T,,(S2) ® T,, (S3) 


with the symplectic form ¢, -0,,+0,,—0,. The range of the differential of 
nm is thus obtained by the construction in Proposition 21.2.13 applied to the 
isotropic subspace {0} x A(T,,(S)) x {0} which proves the statement on the 
rank. Hence it follows from Proposition C.3.3 that the range of z is locally a 
Lagrangian submanifold of S, x S3. 


Remark. If Q is an open set in S, x S, and the map m: 2~'(Q)->Q is proper, 
it follows that 2(7—'(2)) is an immersed closed Lagrangian submanifold of 
Q and that 2~'(Q) is a fiber space with compact fibers over it. 

The construction in Theorem 21.2.14 is applicable in particular when S, 
is reduced to a point. Then it produces a Lagrangian in S, from one in S, 
and one in S, xS,. This idea leads to an important method of parametriz- 
ing C® Lagrangian submanifolds A of the cotangent bundle T*(X) of a 
manifold. Recall from Section 6.4 that if A is a section of T*(X) then A can 
locally be represented in the form {(x, ¢’(x))}} where ¢e€C®™ in an open 
subset of X. This representation breaks down if A is not a section. In 
particular it can never be used if A is conic since the radial vector field p is 
then a tangent of A. However, assume that $(x,0) is a C® function in the 
product of an open set in X and another C® manifold 0. Then 


G= {(x, b.(x, 8); 0, — Palx, A} 


is a Lagrangian submanifold of T*(X) x T*(@) (the minus sign is caused by 
the conventions above about the symplectic form in the product of two 
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symplectic manifolds). The zero section Gy in T*(@) is also Lagrangian. If 
T*(X) x A(T*(O)) and Gx G, intersect cleanly, that is, 


{(x, 8); be(x, 8)=0} 


is a submanifold of X x @ with tangent space everywhere defined by the 
equations d@,(x, 6) =0 then 


(21.2.8) {(x, P(x, 9); Polx, 8) = 0} 


is locally a Lagrangian submanifold of T*(X). We recall that this con- 
struction occurred already in Theorem 8.1.9 in its natural analytical context. 
It is clear that (21.2.8) is conic if ¢ is homogeneous of degree 1. 


Definition 21.2.15. If X is a C® manifold and ${x,6) a C® real valued 
function in an open conic set ['< X x(R*~0) which is homogeneous of 
degree 1 in 6, then @ is called a clean phase function if dp+0 and 


C= {(x, eT; bo(x, 0) =0} 


is a C® manifold with tangent plane defined by the equations d¢,=0. The 
number of linearly independent differentials d(0@/06,), j=1,...,N, is equal 
to N—e on C, where e=dimC—dimX is called the excess. The map 
Ca(x, 4) (x, 64) is locally a fibration with fibers of dimension e and a conic 
Lagrangian A as base; A is said to be parametrized by @. If e=0 the phase 
function is said to be non-degenerate. 


Theorem 21.2.16. If X is a C® manifold of dimension n and A<T*(X)~0 is a 
C™ conic Lagrangian submanifold, then A can always be locally parametrized 
by a non-degenerate phase function @ in an open cone in X x(R"~0). If 
YoEA and xq is the projection of yo) in X, the local coordinates x,,...,X, at Xo 
can be chosen so that with the corresponding coordinates X,,..-,Xq, €15-+2.&, 
in T*(X) the Lagrangian plane €=constant through y, is transversal to A. 
Then there is a unique phase function of the form 


$(x,6)=3'x,0,—H(0) 


parametrizing A; HEC”, H is homogeneous of degree 1, and near yo 


A={(H'(4), O}- 


Proof. Choose local coordinates y,,...,y, at Xo such that the coordinates of 
Xo are all 0 and y,=(0,dy,(0)). Using these coordinates we consider A as a 
subset of T*(IR")\0. The tangent plane A, of A at (0,¢,), ¢, =(1,0,...,0) isa 
Lagrangian plane. It may not be transversal to the plane €=¢,, which is the 
graph of the differential of the y, coordinate. However, we know from 
Corollary 21.2.11 that there is a Lagrangian plane through (0,¢,) which is 
transversal both to A, and to the fiber T*(R"). Hence it is a section of 
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T*(R") of the form 
n=d(y, + A(y)) 


where A(y) is a quadratic form. Now take x,=y,+A(y), X,=y2.-.-,X, =), 
as new local coordinates. The tangent plane of A at y, is then transversal to 
the graph of dx, which passes through y,). The transversality means in 
standard local coordinates that the map 


Aa(x, ¢)r+¢ 


has bijective differential at yy. Hence we can locally take € as a parameter 
on A which is then defined by x=@(&) where ¢ is a C™ function from a 
neighborhood of ¢, to IR”. Since ¢ is homogeneous of degree 0, we can take 
a homogeneous extension to a conic neighborhood still defining A. Now A 
is a conic Lagrangian so we have o=yeé jax;=0 on A, thus 


0=) 6 ,db(H=dHO)-Y Olde, A(O=)VE,6,(8). 
This means that $,(€)=0H(¢)/6¢, and completes the proof. 


It is often but not always possible to use a smaller number of @ variables 
when parametrizing a Lagrangian: 


Theorem 21.2.17. Let the hypotheses be as in Theorem 21.2.16 and let k be the 
dimension of T,,(A)\TZ(X). Then A can be parametrized near y, by a non- 
degenerate phase function in X x(IR‘\0) but not by one involving fewer 
parameters. 


Proof. Assume first that A is parametrized by the non-degenerate phase 
function $(x,0) where 0eR*. Then p=T,, (A) TX(X) is the image of the 
set in T,,(X) x R* defined by d¢,=0, dx =0, so it is at most of dimension x. 
Hence x2 k. 

To prove the first statement we choose local coordinates in X at xg such 
that y,=(0,2,) and A is defined by x=H'(¢) nearby. By a linear change of 
variables preserving x, and therefore y, we can make yw defined by dx=0, 
dé" =0 where €”=(€,,,,-..,€,). Set €’=(€,,...,€,). The Lagrangian plane 
T,,(A) contains so it is contained in the orthogonal plane y’ defined by 
dx’=0. Hence the map 


Aa(x, €)ro(x”, é')elR” 
has bijective differential at y,. In fact, dx’=dx”=0 in the kernel so it is 


contained in yu and satisfies dé’=0. The statement is now a consequence of 
the following : 


Theorem 21.2.18. Let AcT*(R")\0 be a conic Lagrangian manifold in a 
neighborhood of a point y,. where 


Aa(x, C)ro(x”, C)elR” 
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is a bijection for a splitting x=(x',x”), €=(€,¢") of the coordinates. Then 
there is a function S(x”, €') such that near 7 


A= {(0S/6E',x"; &, —08/0x’)}. 
A is defined by the phase function $(x, &')=<x’', > —S(x", €). 


Proof. On A we can regard x’ and é” as functions of (x”, €’), and 
O=<C,dx> =d(Kx', >) — Cx, do) + C0", dx". 


If we take S(x”,é)=<x',é> it follows that 6S/dx” =—€”" and that dS/dé' 
=x’. The last statement is obvious since 0¢/0¢'=0 means that x'=0S/dé’, 
and 6¢/0x =(é', — 6S/éx”). 


The reader may by now have raised - and answered - the question why 
we bother to introduce the notion of clean phase function when there is 
always a non-degenerate one. The answer is that the composition studied in 
Theorem 21.2.14 leads naturally to a clean but degenerate phase function 
when the intersection taken there is just clean and not transversal. However, 
before elaborating this point we must make an important remark on the 
definition of a canonical relation G in the product (T*(X)~\0) x (T*(Y) 0) 
by means of a phase function. Recall that by. definition G is Lagrangian 
with respect to the difference ¢,—oy of the symplectic forms of T*(X) and 
T*(Y) lifted to T*(X)x T*(Y)=T*(X x Y). This differs in sign from the 
symplectic form oy+oy of T*(XxY), so it is the twisted canonical 
relation 


G'= {(x, é, yy —n)(x, g, Ys neG} 


which is Lagrangian with respect to the standard symplectic structure in 
T*(X x Y). (Recall that this sign change occurred already in the calculus of 
wave front sets in Section 8.2, particularly in Theorem 8.2.14.) If we take a 
clean phase function ¢ defining G’ it follows that it defines G through 


(21.2.9) G={(x, b.(x, y, 8), y, — by (x, y, 9); Pol, y, 8) = 0}. 


When we say that a canonical relation is defined by a phase function we 
shall always refer to the formula (21.2.9) with a sign change in the second 
fiber variable. 


Proposition 21.2.19. Let X, ¥, Z be C® manifolds and G,, G, homogeneous canoni- 
cal relations <(T*(X)~.0) x(T*(Y)\0) and (T*(Y)~0) x (T*(Z)\0), parame- 
trized locally by non-degenerate phase functions (x, y,9), OER’, and Wy, z,7), 
teER", defined in conic neighborhoods of (X9,V9,9o) and (Yo,Z0,To) Where dy 
=0, W=0, 6+ ,=0. If the composition G,oG, is clean at the correspond- 
ing point then 


P(x, z, y, 8, 1) = G(x, y, 8) + W(y, Z, 7) 
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is a clean phase function defining the composition, if (y,0,t) are now regarded 
as the parameters. The excess of © is equal to the excess in the clean 
intersection of G, x G, and T*(X) x A(T*(Y)) x T*(Z). 


Note that (y,@,t) belongs to a conic manifold so there is an obvious 
interpretation of the homogeneity condition on phase functions in this situa- 
tion. We might also take 


(y(\\? +| 717), 0, z)eIR"” +4 


as parameter to agree literally with Definition 21.2.15. Here ny =dim Y. 


Proof. That @ and w are non-degenerate means that (locally) 
M ={(x, y,9);5=0}, N={(0,z,1);4,=0} 

are manifolds, with tangent planes defined by d@,=0 resp. dy, =0 and that 
the maps 

Ma(x, ys A)H(x, , yy — PEG, 

Naty, Z, T)e>(y, 2, —WijeG, 
are local diffeomorphisms. That G,oG, is clean means first of all that 

G={(%, Oyen Vn" z OeG, xG,,¥=yn=n"} 


is a manifold, of dimension ny+ny+ny+nz,—2ny+e=nyt+n,+e where e is 
the excess of the intersection. Taking (x, y’, 0, y’, z,t)—¢M x N as a parametri- 
zation of G, x G, we know that G is defined by the equations 


$=, (x, y, 6), y=y’, f= — (x, y, 6), n=", 
n’=0V'207) = —wz(y"s2,7) 
and that the tangent plane of G is defined by the vanishing of all differen- 
tials in addition to those of ), wi, defining the tangent plane of Mx N. All 


this just expresses the hypothesis that the intersection is clean. The claim is 
that 


{(x, z, y, 9,1); Py (x, y, 8)=0, Wy, z, 7) =0, h(x, y, + Wy (y, 2, 1) =O} 
is a manifold with tangent plane defined by the equations 
dg,=0, dy,=0, d(g,+y,)=0. 


Putting y'=y’=y and defining n'= — ¢,(x, y’, 8), n” =,(y", z, 1) identifies the 
sets and the tangent planes. Since the excess of the phase function is by 
definition dim G ~dim(X x Z), the proposition is proved. 


In Section21.6 we shall return to the composition and in particular 
discuss operations defined on half densities in the canonical relations. As in 
Section 18.2 these will occur as symbols for the distributions associated with 
a Lagrangian in Chapter XXV. 
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In questions concerning regularity and existence of solutions of a pseudo- 
differential equation 
Pu=f 


the theory of pseudo-differential operators developed in Section 18.1 allows 
one to replace the principal symbol p of P by the product with any non- 
vanishing homogeneous function q. In fact, gq can be taken as the symbol of 
an elliptic and therefore essentially invertible operator; the equation Pu=f 
is then nearly equivalent to QPu=Qf. The microlocal point of view where 
one first looks only for equations modulo functions with no wave front set 
at a given point (x9,& ) in the cotangent bundle allows one to localize the 
argument so that only invertibility of g at a point is needed. In particular, 
the points where p is non-characteristic are then of no further interest. 

In Chapter XXV we shall extend the theory of pseudo-differential oper- 
ators with a machinery which also allows one to change the principal 
symbol by composition with a local homogeneous canonical transformation. 
To benefit from this one must know simple normal forms, which can be 
studied quite explicitly, such that general principal symbols can be reduced 
to one of them by the two basic operations 

(i) multiplication by a factor +0; 

(ii) composition with a local homogeneous canonical transformation. 
This section is devoted to the geometrical investigation of normal forms. 
The material will not be used until Chapters XXVI, XXVII so the reader 
may well postpone studying it without loss of continuity. 

Let p be a C® complex valued function homogeneous of degree m in a 
conic neighborhood of (0,¢,)eT*(R")\0. By multiplication with the non- 
vanishing function |é|*~" we can always change the degree to any con- 
venient value s; it often turns out that s=1 is a good choice. We first 
observe an immediate consequence of Theorem 21.1.9 where this is so. 


Theorem 21.3.1. Let p be a real valued C® function in a conic neighborhood 
of (0,€ )ET*(R")~ 0, with p(0,E,)=0, such that p is homogeneous of degree 1 
and the Hamilton field H,, is not in the radial direction at (0,¢,). Then there is 
a homogeneous canonical transformation y from a conic neighborhood of 
(0, ,), &,=(0,0,...,1) to (0,E5) such that y* p=€,. 


Proof. By hypothesis the conditions in Theorem 21.1.9 are satisfied by p, =p, 
a=0, b=s,, so we can complete to a homogeneous system of symplectic 
coordinates q,, p, near (0,¢,) with q;=0, p,=6,, at (0, €o). 


When p is complex valued our equivalence problem becomes much more 
difficult but we shall discuss a number of important cases. In doing so we 
write p=p,+ip, where p, and p, are real valued. If dp, and dp, are 
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linearly independent at a point, then the equation p=0 defines a manifold 
of codimension 2 in a neighborhood. We shall first study the case where it 
is involutive. 


Theorem 21.3.2. Assume that p is a C® homogeneous function in a conic 
neighborhood of a zero (0,€))eT*UR")~ 0 and that 


(21.3.1) ReH,, ImH, and the radial direction p are linearly independent at 
(0, fo); 
(21.3.2) {p,p}=—2i{Rep, Imp} =0 in a neighborhood of (0,€,) when p=0. 


Then there is a C® homogeneous function a(x,€)+0 and a homogeneous 
canonical transformation x from a conic neighborhood of (0,¢,) to (0,€ ) such 
that x*(ap)=E, +iey. 


Proof. The hypothesis (21.3.1) implies that V=p~+(0) near (0,é,) is a C® 
conic manifold of codimension 2 such that the canonical one form does not 
vanish in the tangent plane at (0, €,). Condition (21.3.2) shows that we have 
o(ReH,,ImH,)=0 on V that is, V is involutive. By Theorem 21.2.4 we can 
therefore by a homogeneous canonical transformation bring V and €, to the 
desired position, so we assume from now on that €,=«, and that V is 
defined by £,=¢,=0. We may also assume that p is of degree 1. Now 


H,=a(x, €)0/0x,+b(x,¢)d/Ox, on V 


where a=dp/d€, and b=6ép/dé, span the complex plane. This is an elliptic 
operator in x,, x, - essentially the Cauchy-Riemann operator - depending 
smoothly on the parameters (x,,é3,...,¢,), and a, b are homogeneous of 
degree 0. (Outside V we might have a very complicated equation but it will 
be avoided.) By Theorem 13.3.3 we can therefore find local C™ solutions of 
the equation H,u=/f on V if feC™; if f is homogeneous of degree u we can 
take u homogeneous of degree yp. We shall now show that it is possible to 
make {p,p} equal to 0 identically by multiplying p with a non-vanishing 
function. This requires several steps. 

a) By hypothesis we know that {p,p}=0 on V, so by Taylor’s formula 
(Theorem 1.1.9) we have near (0, ¢,) 


{P,P} =foP+fiP 
with fjeC”. Now set g=e” p where w is homogeneous of degree 0. Then 
(21.3.3) {4, G3 =e7**"({p, B} + {p, w} D+ P{W, D} + {W, w} pp) 
is O(p”) locally at (0,€,) if on V 
Sot {p,w}=0, fi, +{W,p}=0. 


Since {p, 3} ={P,p}= ~ {pp} we can always replace fy by (fo—F,)/2 and f, 
by (/,—fp)/2, hence arrange that f,=—/f,. The two equations are then 
identical so choosing w with H, w= — fy on V, we obtain {q, 7} =O(p”). 
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b) Assume that {p,p}=O(p*) at (0, £,) for some k>1. As in step a) 
k 
{p.P}=S ie, f=. 
0 


Choose now qg=e” p where 
k—1 


w= p? wipip ds 
0 


Then the last term in (21.3.3) is O(p?*-!)=O(p**!) since we can only loose 
two of our 2k factors p, p and a Poisson bracket {p, p} =O(p*) appears at the 
same time. In the preceding two terms the contributions where a Poisson 
bracket of p and p occurs are O(p?*-')=O(p**1). Hence (21.3.3) is O(p*t!) if 


f,+H,w,;-H,w,_;=0 on V. 


Since f,_j=— f the equation with j replaced by k—j is equivalent so it 
suffices to solve these equations when j<k/2. We can then take w,=0 when 
j>k/2 and have the equations 


H,wj=—fj, j<k/2, H,w;=—f/2__ if j=k/2, 


for f; is purely imaginary if j=k/2 is an integer. Now f, can be chosen 
homogeneous of degree 1—k, and extending a solution of the preceding 
equation by homogeneity from the case |¢|=1 we obtain w, homogeneous of 
the same degree, hence w homogeneous of degree 0. 

c) By repeating step b) we obtain w/ vanishing of order j on V and 
homogeneous of degree 0 such that {q,,q,} vanishes of order k+2 in a fixed 
neighborhood of (0,¢,) on V if 


q,=pexp(w° +... + w*). 
If we choose w with the Taylor expansion w° + w! +... at V and homogeneous 


of degree 0, the problem is solved to infinite order. 
d) We have now reduced the proof to the case where 


{P1,P2} =C1 Py +Cr Po 


with c; homogeneous of degree 0 and vanishing of infinite order on V. 
Choose f, homogeneous of degree 0 so that 


{el'p,,P2}=e%'c,p2, that is, Ay, f,=Cy 


and so that f,=0 when x,=0, if @p,/e€,+0 as we may assume. The 
solution of this first order differential equation is unique, hence homo- 
geneous of degree 0, and it vanishes of infinite order on V since H,, is 
tangential to V; we just have to differentiate the equation to prove this 
successively. Next we observe that 


fe! p,,e%p,}=0 if {e"p, fr} tele, =0 
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which again has a solution f, vanishing of infinite order on V. Thus 


a=(elp, tie! p,)/(py +ip,)=1+((e%—1)p, +ile!? —1) p2)p; + ipa) 


is infinitely differentiable, and differs from 1 by a function vanishing of 
infinite order on V. Now q=ap has the desired property {Regq, Imq}=0. In 
view of Theorem 21.1.9 we can make Req=€,, Imq=€, in a new homo- 
geneous symplectic coordinate system, which completes the proof. 


Remark. Note that after the reduction the Hamilton field is exactly the 
Cauchy-Riemann operator in x,+ix,. This simplicity is essential for it 
allows us to connect with classical function theory and also gives an 
operator which we know how to handle even outside V. 


We shall now study the opposite situation where {p,p}+0 at a zero y of 
p. This implies that ReH,, ImH, and the radial vector field p are linearly 
independent at y, for a linear relation 


aReH,+bImH,+cp=0 


implies a{ReH,, ImH,}=0 when applied to ImH, and b{ImH,,ReH,} 
=0 when applied to ReH,, in view of the homogeneity. In particular, 
p~'(0) is near y a manifold of codimension 2. Note that when p=0 we have 


{ap, ap} =|a|? {p, P} 


so the sign of the Poisson bracket {Rep,Imp} cannot be changed by the 
operations we allow. The following theorem shows that it is the only 
invariant: 


Theorem 21.3.3. Let p be C® and homogeneous in a conic neighborhood of 
(0,€5) where p=0, {Rep, Imp} 20. Then there is a homogeneous canonical 
transformation x of a conic neighborhood of (0, té,) on a conic neighborhood 
of (0,&)) and a C™ function a with a(0,€))+0 such that 


x*(ap)=€,+ix, ¢,. 


For the proof we need a lemma which we state in sufficient generality 
for another application later on. 


Lemma 21.3.4. Let p and q be real valued C® functions near a point y in a 
symplectic manifold S where p=q=0 and {p,q}>0; and let a, b be positive 
numbers with a+b=1. Then there is a unique positive ueC® in a neigh- 
borhood of y such that 


(21.3.4) {pug} =1. 


If S is conic and p, q are homogeneous of degree m and m’ respectively, then 
u’p and u’q are homogeneous of degree a(l—m’)+bm and am’ +b(1—m). 
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Proof. The equation (21.3.4) can be written in the form 
(21.3.4)' u{p,q} +(bqH,—apH,)u=1. 


On the manifold V of codimension 2 where p=q=0 it reduces to u{p,q}=1. 
If we set uy =1/{p,q}, Pop =P US, Go = ue, we have therefore {py,q)}=1 on V. 
We simplify notation by assuming that p and q already have this property. 
By the Darboux theorem (Theorem 21.1.6) we can choose symplectic coor- 
dinates so that €,=p, y=(0,0). Thus @q/dx, +0 at (0,0) so the equation q 
=0 can be solved for x, to give x, =X(x’, 6), x’=(X,,...,x,). Since we have 
{€,,x,—X(x’,6)} =1 another application of Theorem 21.1.6 shows that we 
can take x, —X(x’,¢) as a new symplectic coordinate instead of x,. With 
such coordinates we have p=¢, and q=Qx,, QEC™®. The equation (21.3.4) 
is then of the form 


udq/0x,+(b 6qg/dx, x, 6/0x,+a€, g/éx, 6/0€,)ut+...=1 


where dots indicate a differential operator vanishing of second order when 
x,=€,=0. Division by @q/éx, reduces the equation to the form discussed 
in Theorem C.2.1 in the appendix and proves the first part of the lemma. 

Assume now that we have homogeneous p and g. Choose u so that 
(21.3.4) is valid in a neighborhood of y and set P=u* p, Q=u?g. Then {P,Q} 
=1. Set M* P=P, M¥ QO=Q, and M*u=u, where M, is multiplication by t 
in S. Then 

te Q;} =t 
by (21.1.6) and since P=u‘t™p, Q,=u?t™ q we obtain 
{uf p,upgy=ee-—™, 

In view of the uniqueness of the solution of (21.3.4) it follows that 


re aimed 


in a fixed neighborhood of y when ft is near 1. Hence u can be extended to a 
homogeneous function of degree i —m—m’, and (21.3.4) remains valid in a 
conic set. This proves that P and Q have the stated homogeneity properties. 


Proof of Theorem 21.3.3. Assume that {Rep,Imp}>0. If we apply Lem- 
ma 21.3.4 to Rep, Imp with a=b=4 and y=(0,£,) we obtain a function u 
such that u? p=p is homogeneous of degree $ and {Rep,Imp}=1. Now we 
choose a real valued homogeneous function h of degree 1 such that 


{Re p,h} = {Imp,h} =0 


and h(y)=1. To do so we just solve these differential equations with homo- 
geneous C® values prescribed on V=p~‘+(0), equal to 1 at y. This is possible 
since [Hp.5,Hims]=0 by the Jacobi identity, which maked the Frobenius 
theorem applicable (see CorollaryC.1.2 in the appendix). Note that the 


plane spanned by Hp,, and H;j,,, is transversal to V. The uniqueness of the 
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solution proves its homogeneity. Now Theorem 21.1.9 shows that we can 
find homogeneous symplectic coordinates near (0,¢,) such that 


y,=h* Rep, y,=h-*Imp, 7,=h, 
and so that y,;=...=y,=7,=...=,_,=0 at y. Then 


h? p=n,+iyin, 
which proves the theorem when {Rep,Imp}>0. For the opposite sign we 
apply this result to p and conclude that there are symplectic coordinates 
such that (0,€,) corresponds to (0,¢,) and the product of p by a non- 
vanishing function is equal to 4, —iy,y, Replacing the coordinates y,,7, by 
—y,,—, then proves the statement. 


The zero set of p may be a symplectic manifold of codimension 2 even if 
{Re p, Im p} is identically 0 in it. An example is 


P=P,+ip,=E,+ixte, k>1. 


Then {p,,P2}=kxi-* Gas H,, =0/0x,, H,,= —kxi-} €nO/06, +xf 0/0x,, sO 
all Poisson brackets formed with at most k factors vanish when €,=x,=0 
but Hy, P2 = {p1, {Pi,---, {pisp2}.--}} =kté, # 0 if &, # 0. This situation is 
characterized invariantly in the following theorem. 


Theorem 21.3.5. Let p=p,+ip, be a homogeneous C® function in a conic 
neighborhood of y=(0,o) where p=0, H,, +0, and assume that p, has a zero 
of fixed order k>1 near y on each integral curve of H,, in p; ‘(O) (that is, the 
leaves of the Hamilton foliation of the involutive surface pz; *(0)). Then the 
zeros of p form a manifold V near y. On V we have H,,=bH,,, all Poisson 
brackets of p, and p, with at most k factors vanish, and for every aeC® 


(21.3.5) Hkeap lm ap=la|*(Re(a+iba)*~' Hk, Imp. 


There is a homogeneous canonical transformation x from a conic neighborhood 
of (0,¢,) to a conic neighborhood of (0,& ) and a C® function a with a(0, &) 
+0 such that 

x*(ap)=E, +ixy , 


unless the sign on the right-hand side of (21.3.5) is always <0, that is, k is odd 
and Hy, Imp <0; then Imp changes sign from + to — along the integral 
curves of H,, and the statement is true with ¢, replaced by —é,. 


Before the proof we give a general reduction which follows from the 
Malgrange preparation theorem. It does not really provide a normal form 
but it will be of fundamental importance in Chapters XXVI and XXVII. 


Theorem 21.3.6. Let p=p,+ip, be a C” homogeneous function in a conic 
neighborhood of (0,f5) where p=0 and H, is linearly independent of the 
radial vector. Then there is a homogeneous symplectic transformation y of a 
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conic neighborhood of (0,€,) on a conic neighborhood of (0,f,) and a C™ 
homogeneous function a with a(0,€,)+0 such that 


x (ap)=E, tif (%E) 
where €’=(&,,...,€,). Thus the imaginary part is independent of €,. 


Proof. We may assume that p is of degree 1 and that Hg,, does not have 
the radial direction. By Theorem 21.1.9 homogeneous symplectic coordinates 
can then be chosen so that p,=¢, and €)=s,. In particular ép/d¢, +0 so by 
Malgrange’s preparation theorem (Theorem 7.5.6) we can find gq, reC™ near 
(0, ¢,) so that 

Gi=qptr 


and r is independent of ¢,. Restricting to €,=1 and extending by homo- 
geneity from there we can make q and r homogeneous of degree 0 and 1 
respectively. Thus writing r=r,+ir, we have 


qp=C,—",—ir, 


and we can apply Theorem 21.1.9 again to choose new homogeneous sym- 
plectic coordinates with y,=x, and n,=¢,—r,. Then H, =H,,= —0/0€, so 
H,,r,=0. In the new coordinates we therefore have —r,=f(y,n') so qp=n, 
+if(y,7’) with f independent of 74,. Hence a=q and x: (y,)+>(x, 4) have 
the required properties. 


Proof of Theorem 21.3.5. In the hypersurface V, defined by p, =0 the zeros 
of q=H*~' p, form a hypersurface V, where {p,,q}+0 and Hi p,=0, j<k. 
Thus the restriction of p, to V, is divisible by q*, so 


P,=bp, t+cq*, 
{P1,P2}=P1{p,,5}+ {p,,c} q* +ck {p,q} 4 


k-1 


where b, ceC®. Thus the zero set V of p is locally equal to V, and H,, 
=bH,, there. All Poisson brackets with at most k factors p,, p, vanish on V. 
In fact, taking a Poisson bracket of {p,,p,} with cq“ gives a zero of order k 
—1 which cannot be removed by k—2 subsequent Poisson brackets, and the 
Poisson bracket of functions divisible by p, is again divisible by p,. Hence 
it suffices to prove (21.3.5) when a is a constant. Then we have 


{Reap, Im ap} =i{ap, ap}/2=|a|” {Re p, Imp}, 
and Hy. 7, may be replaced by (Re(a+iab))*~* (Hg. ,)*'. 
Now choose a and new symplectic coordinates such that ap has the 
form 
q=S,+if (x, 0) 


and y=(0,¢,) (Theorem 21.3.6). Since Hp,,=0/0x,+0, it follows from the 
part of the theorem already proved that the zeros of g near (0,¢,) form a 
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manifold of codimension 2 where 
E€,=0, Oa f(x,&')/0xi=0 when j<k and d* f(x, &)/dx* +0. 


The equation 6*~' f (x, &')/dx4-! =0 defines x, as a C® function X(x’, 2’) and 
a f(x, é)/dxi must vanish when x,=X(x’,é) since the zero set of q is a 
manifold of codimension 2. Thus the quotient f(x, &\(x,—X(«’,f))* is 
smooth and positive, if 0 f(x, €’)/Oxf>0 at (0,¢,) as we now assume. Hence 
it has a positive k' root, so f(x, @)=g(x,@)* where g is C”® and homo- 
geneous of degree 1/k, and {¢,,g(x,¢)}>0 at (0,¢,). We can now apply 
Lemma 21.3.4 with p replaced by €, and q replaced by g, with a=k/(k+1) 
and b=1/(k+1). This gives a function u such that 


(Wat it awOrDg,  paulldrng 


Here ¢ is homogeneous of degree k/(k+1) and w is homogeneous of degree 
1/((k+1). As in the proof of Theorem 21.3.3 we can now choose h homo- 
geneous of degree 1 with h(0,¢,)=1 so that 


{, h} = {, h} =0. 


By Theorem 21.1.9 there are homogeneous symplectic coordinates (y,7) at 
(0,¢,) with y=0, 7 =¢, at (0,¢,) and 


my ah, yy=hO DY, on 


I 
= 


Then 
yk + 1) piles ME, +if~=n, +i yt Nn 


so the theorem is proved when 0*f/dxi>0. If the opposite sign holds we 
can apply this conclusion to p, hence reduce p to the form n,—iy*y, at 
(0,¢,). If k is even we multiply by —1 and take —y,, —y, as new coor- 
dinates instead of y;,7 to obtain the desired form. If k is odd we replace y,, 
n, DY —Y,_, —N, and obtain the equivalence with 4, +iy{n, in a neigh- 
borhood of (0, —¢,). The sign of (21.3.5) is then invariant so it is not possible 
to have equivalence with 7, +iy{n, at (0,¢,). 


21.4. Folds and Glancing Hypersurfaces 


In the preceding sections we have studied a number of symplectic classifi- 
cation problems in non-degenerate situations. We shall now consider some 
of the simplest singular cases, starting with Lagrangian submanifolds A of a 
cotangent bundle T*(X). If A is a section, that is, the projection 2: A X is 
a diffeomorphism, we know that A is locally the graph of the differential of 
a function in X; indeed, it was this observation which first brought us in 
contact with Lagrangian manifolds in Section 6.4. We shall now examine the 
case where z is a folding map. (For definitions and basic facts see Appendix 
C4) 
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Theorem 21.4.1. Let X be a C® manifold and A a Lagrangian submanifold of 
T*(X) such that the projection AX has a fold at (X9,€)e€A. Then there 
exist local coordinates x in X, vanishing at x), and a C® function W(x) near 0 
such that at (xp),&,) the manifold A is parametrized by 


(21.4.1) A= {(x,0/Ox', Ob/Oxy = x3); x, 2 Of. 
Here x'=(x,,..-;X__1): 


That (21.4.1) defines a manifold with a folding projection is clear if we 
put x,=2? for then the set is of the form 


{(’, 12,0, W(t, t), ...,8, We, t2) +1,)}. 


When t, +0 we have the graph of the differentials of the two functions w(x) 
+2? so it is clear that A is Lagrangian. 


Proof of Theorem 21.4.1. By Theorem C.4.2 we can choose local coordinates 
t,,.--,t, in A and x,,...,x, in X so that the projection of the point in A 
with coordinates (t’,t,) has the coordinates (¢’,t?) in X. Then A is defined by 


A={(t',t;, E(O)} 


where €(t) is a C®-function of t with values in IR” and we are using 
standard coordinates in the cotangent bundle. That A is Lagrangian means 
that 


n-1 


Y, €;dt;+€,,d(t?) 


1 


is closed, hence the differential of a function ¢(t) with 
€;=0¢/0t;, j<n; 2t,¢,=06/0t,. 


Hence A is the graph of the differentials of the functions $(x’, +x?) when 
x,>0. To put it in the desired form we write ¢=¢,+6, where ¢,(t) 
=(P(t',t,)+ o(t', —t,))/2 is even under the involution in A and @, is odd. 
Thus we know by TheoremC.4.4 that ¢$,(t)=w(t’,t?) for some weC® at 0. 
Since t,,...,f, are parameters in A we know that 0¢,/0t,+0 at 0 for some j. 
Now 0€,/dt,=07? $/dt,0t,=0 at 0 if j<n, so we must have 0¢,/dt,+0 at 0, 
that is, 
0° o,(t)/Otz = 0° $(0)/Ot; +0. 

Changing the sign of t, if necessary we may assume that the sign is positive. 
Since 6/ ¢,/dti=0 when t,=0 if j<3 we can now write ¢,(t)=t? F(t) where 
FeC®, F(0)>0, and F is an even function of t,. Hence Theorem C.4.4 shows 
that there is a C® function G near 0 such that 


3 4,(t)? =G FO) tf =GUe, tp). 


We have G=0 and 0G/dx,>0 when x,=0, and ¢,(x’, +x,7)= +4G(x)}, 
x, >0. If we take G(x) as a new variable instead of x,, the theorem is 
proved. 
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Remark. Ww and the coordinate function t, are uniquely determined where 
t,>0 but otherwise arbitrary apart from the smoothness. The other vari- 
ables are any variables which define a coordinate system. Another form of 
the theorem is therefore that there exist two C® functions @ and w at xy 
with $(x,.)=0, $’(x9)+0, such that the projection of A is locally defined by 
@20 and A is the graph of the differentials of the functions y+3¢*. 
Lagrangians of the form just considered occur frequently in diffraction 
theory. 


Next we shall study folds over a symplectic manifold. This will lead to 
the degenerate symplectic structure discussed already in Section 21.1. 


Theorem 21.4.2. Let S be a C® symplectic manifold, T a C® manifold of the 
same dimension 2n and f: T>S a C® map with a fold at t,. Then there exist 
a local symplectic coordinate system (x,&) vanishing at f(t)) in S, and local 
coordinates (y,n) vanishing at ty in T such that 


(21.4.2) SOM=Ops 0 Va 7/2, Nass Mn) 
Thus the pullback f* o of the symplectic form in S is given by 


(21.4.3) ny dn, Ady,+>.dn,ady,. 
2 


Conversely, let y, y be local coordinates vanishing at ty which are even with 
respect to the involution of the fold apart from n, which is odd. If f*o is 
equal to (21.4.3) then there exist local symplectic coordinates x, € at f(t 9) such 
that (21.4.2) holds. 


Proof. By TheoremC.4.2 we can choose local coordinates t,,...,t,, in T 
vanishing at ty and s,,...,S,, in S vanishing at f(t)) such that 


S(Q=(y, ston 1 03,/2)- 


By the Darboux Theorem 21.1.6 we can choose local symplectic coordinates 
x, € in S vanishing at f(t)) such that €,=s,,. Then 


Yi=h* x; j21; ny=t* oj, jel, 11 =Clons 


are local coordinates at t,. In fact, if the differentials of y,, 1, at to in the 
direction y vanish for all j, then <y,dt,,>=0 and 


¢f' (to), dxpacf (to) 7,46,» =9, J=l,...,n, 


since < f'(to) y, dé,> =<, dt3,,/2> =0; hence y=0. Since n7/2=f*€, we have 
(21.4.2), which proves the first part of the theorem. 

To prove the second part we choose, using Theorem C.4.4, C® functions 
4, Py, in a neighborhood of f(t)) such that f*q,;=y, and f*p,=n, if k+1, 
f* p,=n3/2. From (21.4.3) it follows that o= ¥dp, A dq, in the range of f; in 
Sue we have local coordinates at f(t)). They are symplectic when 
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p,>O0 but fairly arbitrary when p,<0. Now define ¢;=p;, xj=q, in a 
neighborhood of 0 when p,>0, and extend the definition to a full neigh- 
borhood of 0 so that 

H,,x;=0, Hy,o;=—44;. 


These equations are valid when p,>0, and since H, = —0/0€, when p,=0 
they define x, € uniquely in a neighborhood of 0. For the new coordinates 
the commutation relations are also valid when &, <0, for 


Ay {xX ={41 {x;,%}} = — {xy {91 x;}} + {x;, {41% 33 =0 


by the Jacobi identity and similarly for the other Poisson brackets. With 
these symplectic coordinates (21.4.2) remains valid, and the proof is com- 
plete. 


The pullback o,=f*o of o to T is of course a closed two form. If i is 
the involution of the fold we have i*a,;=0,, because foi=f. This is also 
obvious from (21.4.3) since i just changes the sign of 7,. The following 
version of Theorem 21.1.7 for manifolds with involution gives a characteri- 
zation of this situation. 


Theorem 21.4.3. Assume that S and a satisfy the hypotheses of Theorem 21.1.7 
and that i is a C® involution with i*o=o and S, as fixed point set. Then 
there exist local coordinates y, 4 such that o@ is equal to (21.4.3) and the 
involution only changes the sign of the y, coordinate. 


Proof. We shall give a proof which is completely independent of Theo- 
rem 21.1.7 since the existence of the involution actually simplifies matters. 
Let s,,...,5,, be local coordinates at a point s,éS, such that the involution 
just changes the sign of s,,. Write 


a=) a,,ds;,Ads,. 


i<j 


: pint OF stan, aig Serta 
Since i*o=0 we have i*a,,=a,, if j+#2n, and i*a,,=—a,,; 


Theorem C.4.4 it follows that for suitable A;,eC” 
o= (ft A)afadf, 


where f(s)=(5,,---,52,_1553,/2), for this means only that f* A,,=a,,; when 
j<2n and that s,,f* A,;=4,; when j=2n. The form 


ij 
G=)1A,(t)dt, dt, 


is closed when t,,20 since the pullback by f is closed, and it is non- 
degenerate at 0. By the Poincaré lemma 


&=d(¥B,(t)dt) 


when t,,,20 and |t|<60 say, and B,eC® then. Extending B, to a C® function 
in {teR?"; {t|<6} we find that the definition of A, ; can be chosen so that ¢ 


if j=2n. By 
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is a non-degenerate closed two form in a full neighborhood of 0. Since ¢ 
= f* & Theorem 21.4.3 is now an immediate consequence of Theorem 21.4.2. 


Theorem 21.4.4. Assume that S and o satisfy the hypotheses of Theorem21.1.7 
and that f, g are two involutions with f*o=o, g*o=o having S, as fixed 
point set and with linearly independent reflection bundles. Then there exist 
local coordinates x, € such that 


(21.4.4) o=E, dé, ndx, +d, ndx, 
2 
(21.4.5) I@QO=,-C.0); 80,0 =O44+655%5—E15€). 


Proof. Using Theorem 21.4.3 we choose coordinates x, € such that o and f 
have the desired form, and using Theorem C.4.6 in the appendix we choose 
another set of local coordinates y, 4, vanishing at the same point, such that 
f and g have the desired form. We have to reconcile the two coordinate 
systems. Note that in the x€ system the vector field 6/éx, when €,=0 is 
determined as the radical of the symplectic form while in the yy system the 
vector field 0/Oy, when 7, =0 is uniquely given by the span of the reflection 
bundles. A first step in the proof is therefore to show that these vector fields 
do agree. To do so we shail prove that 


(21.4.6) niin Ys=M {n1.nj 
=0, j>1, when y,=0, that is, €,=0. 


(Note that 7, H,, is a smooth vector field.) To prove this we set u=y, or n;, 
for some j>1, and observe that f*u=u, g*u=u but that —y,=f*g* y, 
—y,. This means that when 7, +0 


—{,,us={f* 9% y,.f* g* u} —{y, uh =f 9* {yu} —{y,, 4}. 
Hence 
—ny ing uy=f* 9* (0; (1,43) —m, (1,43 =0 when 74,=0, 
which proves (21.4.6). Now 
nH, =1,/€ ,(0n,/06, 0/0x, —On,/0x, 0/0§,)+... 


where the dots indicate a smooth vector field vanishing when ¢,=0. We 
have 47,=0 when €,=0 so én,/éx,=0, 0n,/0€, +0 and 


n, H,, =(6n,/0€,) 6/éx, when ¢€,=0. 
Hence (21.4.6) gives 
(21.4.7) dy,/6x,=On,/Ox,=0, j=2,...,0; 
6n,/Ox,=0 when €,=0; 


proving the statement about agreement of 0/0x, and 6/@y, made above. 
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We can express the xé coordinates in terms of the yy coordinates, 


x;=o,(y.nin), all j; 
Ex=nyWienien), €=W(ynin), j>t. 


Here we have used that all coordinates except ¢, and 7, are even under f 
and that these are odd. Differentiation with respect to x, gives when €, =0, 
if we use (21.4.7) and note that it implies 6y,/dx, +0, 


0¢,/6y,=0, Oy,/éy,=0; jl, €,=0. 
Thus 


X,= (0, y',0, 1’); E,=W,0,y',0.0, j+1, 


are equal to x, resp. ¢; when ¢,=0 and are even under f and g. Set X,=x,, 
which is an even function with respect to f. 

If veC”, f* v=v and dv/éx, =0 when €,=0, then H, is a smooth vector 
field equal to 


(21.4.8) 8?v/0E2d/dx, +9 (d0/08 j/8x,—dv/dx,,0/0E,) 
2 


when €, =0. In fact, when €, +0 we have 
1 
o1 


Since v and 0v/dx, are even functions of €, we have dv/dx,=O(é?) and 
Av/aE, =O(E,), hence dv/0é, =, 670/027 + O(E?). 

We can now complete the proof by following that of the Darboux 
Theorem 21.1.6 rather closely. To be able to use (21.4.8) we shall choose 
symplectic coordinates q, p with the required properties in the order p,, 
Ga.+++> Pao Inv Pis U1. S€t Py =. Then p,»=¢, when €,=0, and f*p,=g*p, 
=p,. Now we determine q, by solving the differential equation 


H,=—(00/0é, 0/dx, —dv/dx, 9/0€,) +5 (dv/de ;0/0x,—dv/0x, 0/0E). 
2 


{P2423} =H,,92=1; q.=0 when X,=0. 


The solution is unique near 0 since dX,=dx, there. Now the Poisson 
bracket condition and the boundary condition are both invariant under f 
and under g, so f*q, and g*q, are also solutions, hence f*q,=g*q,=q). 
When €,=0 the equation is satisfied by g, =x, since (21.4.8) shows that 


H,, =0/0x,+ 0? p,/0€4 4/0x, 


then, so g,=x, when €,=0. The equation (21.4.8) is therefore applicable to 
qz, and we can now choose p,; with 


H,,P3=H,,P3=0, p3;=&3; when p,=q,=0. 


Continuing in this way we obtain p,,q),...,P,.4, Satisfying the commu- 
tation relations, all invariant under f and g, so that these functions are 
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equal to €,,x,,...,€,,x, when €,=0. Now we determine p, from the differ- 
ential equations 


H,,P,;=H,,P,=0, j>1;  py=n, when p,=...=q,=0. 


Since 4, is odd with respect to f and g, this is also true for p,. Finally the 
solution q, of the equations 


A,,41:=H,,9,=9, jel; Pi A, 4, =1; 
q,=0 when p,=...=q4,=*X,=0 
is even with respect to f since X, is, but g, has no simple relation to g. 

Now introduce q and p as new coordinates z,f. In terms of these 
coordinates the symplectic form is given by (21.4.4), f is given by (21.4.5), 
and 

g(z, 4) =(zZ, + G(z, ¢), Z, ~Ci5 ¢’) 
for some so far unknown G. Now the fact that g is symplectic gives 
Gi d(— Ci) nd(z, + GZ, Q)=C,dl, adz, 

so df, \dG=0 which means that G is a function of €, only. That g is an 
involution means that G is odd, and G’(0)+0 since the reflection bundles of 
f and g are not the same. We shall now make a final change of the z,C, 
variables to obtain the desired G. In this step the other variables are no 
longer involved so we drop them and write t=z,, t=, to avoid subscripts. 

Put «(t,t)=(tA(z), B(t)). Then xo f = fox if B is odd and A is even, and k 
preserves the symplectic form tdt adt if 


(21.4.9) B(t) B(t) A(t) =1. 
Finally we have x~‘ogox=Qpo, Qo(t,t)=(t+t, —1) if 
goK(t,t)=(t A(t) + G(B(t)), — B(t)) =k o go(t, t) 


=((t+7) A(—1), B(—7)). 
This means that 
G(B(t)) =1 A(t) = t7/B(t) B'(t), 


‘which is easily integrated and gives 
Bit) 
t3/3= § sG(s)ds. 
0 
Since G is odd and G’(0)+0 the right-hand side can be written B*G,(B)/3 
where G, is an even function with G,(0)+0, so the equation becomes 


1=BG,(B)t 


which has a unique solution by the implicit function theorem. We can 
determine A from (21.4.9) with A even and smooth at 0 since B has a simple 
zero. This completes the proof of Theorem 21.4.4. if we take x~'(t,t) as new 
variables. 
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As a first application of Theorem 21.4.4 we shall give a normal form for 
canonical relations with folds. 


Theorem 21.4.5. Let S, and S, be symplectic manifolds of the same dimen- 
sion and with symplectic forms ¢,, ¢,, and let G be a canonical relation from 
S, to S, near (s9,s$)eS, xS,, that is, a Lagrangian submanifold with respect 
to ¢,—0,. Assume that the projections x;: GS, have folds at (s{,s9). Then 
one can choose local symplectic coordinates (x,é) in S, vanishing at s° and 
(y,) in S, vanishing at s$ such that 


G ={0,8y, Ms =m = 201 —y1),2/ = y= 1} 


Proof. Set o=x*oa,=n%0,. By Theorem 21.4.2 this is a.degenerate symplec- 
tic form in G of the type (21.4.3). The reflection bundles for the involutions 
f, and f, in G defined by 2, and z, are different, for they are the kernels of 
dn, and (1,7) is the injection of G in S,xS,. Hence it follows from 
Theorem 21.4.4 that we can choose local coordinates t,,...,t,, 7,,--.,T, in G 
vanishing at (s?,s9) such that 


n 
o=T,dT, Adt, + Qdtjadt;; 


A(D=(t+t 0, -the); fol )=( -1,,7/) 
where t'=(t,...,f,). Hence t,,...,t,, 77, T2,---T, are invariant under f, so 
by Theorem 21.4.2 there are local symplectic coordinates (y,7) at s? such 
that 
(t,t) =(t, 17/2, 7’). 
Similarly t,+1,/2, t,...,t,, Tj, Tg.---5T, are invariant under f, since t,+7, 
+(—1,)/2=t,+7,/2. Since 


g=t,dt,Ard(t, +1/2)+)idtjndt, 


we can also choose symplectic coordinates (x, €) in S, such that 
(t,t) =(t, +7,/2, 0’, t2/2, 7’). 
This means that 
G={(t,+17,/2,t', 07/2, 7; t,,t',17/2,r')} 


which is clearly a Lagrangian with the required properties. Note that the 
canonical relation affects only one pair of symplectic coordinates. 


To motivate the next application of Theorem 21.4.4 we shall sketch the 
situation where it occurs; this gives an important clue to the constructions 
used in the proof. We consider a (second order) hyperbolic equation in X, 
with principal symbol p(x, é). The bicharacteristics of p, that is, the curves in 
the Hamilton foliation of p~'(0), are the light rays of geometrical optics. 
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When they arrive at a surface S, defined by s(x)=0, the reflection law is 
applied. This means that the position (x,é€_)e€T*(X) where the incoming ray 
hits $, or rather T*(X)|5, is replaced by the starting point (x,¢_,) for the 
outgoing ray in such a way that ¢, —¢_ is conormal to S and p(x,¢,)=0 of 
course. Another way of describing this is that we go from €_ to €, along 
the Hamilton foliation of T*(X)|s, that is, the zeros of s. We then follow the 
Hamilton foliation of p until the next encounter with T*(X)|, and so on. 
With this formulation the functions p and s take on surprisingly symmetri- 
cal roles. Some conditions must of course be imposed for the procedure to 
make sense for all rays near a ray tangent at (xo,&)) say - which is the 
really interesting case. The first is that we can find precisely one other zero 
of p(x, €) near €_ in the normal direction (possibly coinciding with €_). This 
is guaranteed if we know that the second order € derivative in the normal 
direction is not 0, that is, {s, {s,p}}+0. The second is that there is some 
good control of how the bicharacteristics of p meet the surface S. The 
simplest situation is where the given ray does not have higher order contact, 
that is, H 3s={p, {p,s}}+0 at (x9,é_). Together with the obvious condition 
that dp and ds are linearly independent these are the conditions which we 
shall impose: 


Definition 21.4.6. Let S be a C® symplectic manifold and F, G two C@ 
hypersurfaces intersecting transversally at s,¢S. Then F and G are said to be 
glancing at s, if the Hamilton foliation of F (of G) is simply tangent to G 
(to F) at So. 


If we choose functions f and g with f=0 on F, g=0 on G and df +0, 
dg+0 at so, the transversal intersection means that df, dg are linearly 
independent at sy, and the other conditions can be written 


(214.10) {Fe}=0, {Li fg}}+#0 and {g,{g,f}}+0 at so. 

Thus we have precisely the situation in the preceding motivating discussion. 

Set J=FG, which is a manifold of codimension 2, and 
K={seJ;{f,g} =0}. 


This is the set where the intersection is not symplectic. K is a manifold of 
codimension 3 for df, dg and d{f,g} are linearly independent at s, since 
H aee g}+0 but H sea rf =0, and H yo, are linearly independent. To 
clarify the geometrical picture we shall introduce local coordinates such that 
the surfaces F, J and K are conveniently located: 


Lemma 21.4.7. There is a neighborhood U of sy in S and local symplectic 
coordinates y, n vanishing at s, such that 
FOU={(y,meU;y,=0}, IAOU={(y,meU; y,=0, 1,=n7}, 
Kou={(y, meu; y, =, =7,=9}. 


312 XXI. Symplectic Geometry 


Proof. Choose local symplectic coordinates x,€ vanishing at sy such that F 
is defined by x, =0 and G by g(x, €)=0 near (0,0). From (21.4.10) it follows 
that 

dg/0E,=0, 8 g/d{+0 at (0,0). 


Since g(0,0)=0 this means that g has a double root at 0, as a function of €,. 
The Malgrange preparation theorem (Theorem 7.5.6) implies the existence of 
C® functions a, b vanishing at 0 and independent of €, and a C® function p 
with p(0,0)+0 such that 


g=p(ei+al,+b)=p((E, +4/2) +b —a’/A). 


Now, using Theorem 21.1.6, we can choose new symplectic coordinates (y, 7) 
vanishing at s, such that 


m=, 44/2, yy=Xy. 
Since 0/éyn,= —H,, = —H,,=0/0&, we have b(x,')—a(x, €')’/4= —c(y,7) 


vi 
where c is independent of y,. In the new coordinates the surface G is 
defined by g,(y,”)=n?—c(y,7')=0, where c(0,0)=0. The linear indepen- 


dence of dy, and dg, gives 


dyiq(0, 0) #0. 


By a symplectic change of the (y’,7’) coordinates we can make c(0, y’,7') one 
of them, so we may assume that c(0,y’,7'/)=y,. Then F,J,K have the form 
described in the lemma. 


Let us now list some simple but important consequences of the lemma. 
Choose U as a ball in the y,4 coordinates, and set 


i-ym=(,-non), Ometau 


with the notation in the lemma. Since the curves in the Hamilton foliation 
of F are just the parallels of the €, axis it follows from the lemma that (y, ) 
and i,(y, 4) are the only points in J on the same curve in this foliation; they 
are different if (y,n)€K. Hence the C® involution i, of J with fixed point set 
K is invariantly defined, without reference to the local coordinates. It 
expresses the construction in the introductory motivation above. The re- 
flection bundle of i, is generated by H, since this is 6/@y, in the local 
coordinates. The restriction of the symplectic form to J is 

n—-1 
d dn; Ady, +2n, dn, dy, 
so it satisfies the hypotheses in Theorem 21.1.7, with n replaced by n—1. 
Finally, if weC°(U) then H, is tangential to J at sy if and only if dy 
vanishes in the direction H, and in the direction of the radical of the 
symplectic form restricted to K. In fact, these conditions mean respectively 
that éw/éy, =0 and that dy/dy,=0, and the tangent plane of J is defined by 
dy, =dn,,=0 at (0,0), so H,, is tangential when 0y/dn, =0y/dy, =0. 
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We are now prepared for the proof that all pairs of glancing hyper- 
surfaces are equivalent: 


Theorem 21.4.8. Let S be a C® symplectic manifold and F, G two C® 
hypersurfaces in S with a transversal glancing intersection at s,éS. Then there 
are local symplectic coordinates (x, €) vanishing at s, such that F is defined by 
x,=0 and G is defined by €? —x, —€,=0. 


In particular, the dimension 2n of S is at least 4. That n>1 is im- 
mediately clear since {f,g}=o(H,,H,)=0 means that H, and H, are pro- 
portional if n=1. 


Proof of Theorem 21.4.8. The manifold J with the restriction o|, of the 
symplectic form and the involutions i,, ig - defined as i, - satisfy the 
hypotheses of Theorem 21.4.4 by the preceding remarks. Hence we can 
choose local coordinates f,,...,t,_1,74,.--5T,-, in J at sg so that 


n—-1 
o|;=2t,dt, Adt,+ ¥ dt; dt,, 
2 


ip(t, t=(t, Ty T a5 erey Ty D> 


ig(t, T) =(€, +274, by, ---5tg ts —T 19 T 25 -0ey Te) 


We have inserted factors 2 here to get simpler formulas below. This is 
permissible since Theorem 21.4.4 shows the equivalence with the old model 
which is also immediately verified by a change of variables. The theorem 
states that there are local coordinates (x, €) in S such that 


n—1 
J = {0X95 00-5 Xo bts Sn ED o|y=20, de, Adx,+ Y do, Adx;, 
2 


ip(O, X95 0065 Xo Ey 009 Ey 19 O 7) =(0, X95 02 X pp —&,, Enyce En Ea) 
Ig(O, X25 0-5 X yo E4909 En 4 E27) =(0,x2,...,X, +2, —€1,E, wees En tp &%). 


In fact, the Hamilton vector field defined by x, is 6/0€, and that of &? 
—x,—&, is 2&,0/Ax, —0/0x, + O/0€ ,, 80 X94 02-)X__ ps ban eer Eg_ pp CU —X =E, 
and €,+x, are constant on the curves of the Hamilton foliation of G. 
Introduce now the local coordinates y,y of Lemma 21.4.7 which make 
F,J,i, and K have the desired form. Thus J is regarded as a subset of 
T*(R’), 
F= {0,925 0-25 Vue May 0 Mn ve ID} OU 


parametrized by y2,...,VasM%1>-+-2%n—1- We define our new coordinates x, € 
first on J by 


(214.11) -x,=t, €=0) 1<j<n;  x,=0, €,=1,, x,=0,, €,=17. 
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where t,t are regarded as functions of y.,...,V,.1%1,-+-»%,- Lhen we have 
n 

(21.4.12) Yidé;adx,=ol|, on J. 
2 


We must now extend the functions ¢; and x, from J to a neighborhood of 
(0,0) so that we obtain a symplectic coordinate system with x,=0 on F and 
&2—x, -&,=0 on G. 

The map 


Ja(0,y2, seo Vno M15 iaxs Ng cas Hp (a5 =++9 Vas 42> nastneaata) 


is a fold map with involution i,, and since x,,...,x,, €,...,€, are even 
under i, there exist functions %,,...,%,, €),-..,€, of (y’,) in a neighborhood 
of 0 which restrict to x,...,x,, &,-..,€, on J if we regard them as 
functions in F = {(y,7); y, =0} independent of 7,. From (21.4.12) we obtain 


(21.4.12) Yidé,ad&;=Yidnjvdy,; in F 
2 2 


in the set 4,>0 where & , and X, are uniquely determined. As in the proof of 
Theorem 21.4.2 we can then change the definition when y,<0 so that 
Koy ivn'g Xing Copco. become symplectic coordinates in T*(IR"~') in a full 
neighborhood of 0. Set X, =0, still when y, =0. 

To extend the functions X,, e; to a full neighborhood of 0 we observe 
that x,+é, is a function on J which is invariant under ig. Hence the 
preceding argument with F and G interchanged shows that there is a C” 
extension Q to a neighborhood of 0,0 which is constant along the foliation 
of G. Thus 


(21.4.13) Q=x,+é, on J. 


The derivative of Q along the radical of o|, is not 0 because of the term x, 
so Hg is tangential to G but not to J, and G is near (0,0) the Hg flowout of 
J. \t follows also that Hg is transversal to F, for J=F OG. We can therefore 
define x,,...,X,, €5,---,€, uniquely near 0 by 


(21.414) Hgx,;=0, j=2,....n, Hgé,=0, k=2,...,.n—-1, 
Hox,=1, Hgf,=—-1, 

with initial values given by %,,€, on F. Then we obtain 
(21.4.15) {oj Xb =Ojn,, {0),6 7} =9, 

{x,, X}=0; f,f=2,...,n; kK =1,...n. 
By the Jacobi identity and (21.4.14) the Poisson brackets of these quantities 
and Q are equal to 0 so it is sufficient to verify (21.4.15) on F. There the 
derivatives with respect to y, all vanish so the Poisson brackets reduce to 


Poisson brackets in the variables y,,...,y,, %2,.---.4, Which satisfy the re- 
lations (21.4.15) in view of (21.4.12) and the fact that x,=0. 
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Now define the extension of €, by 
(21.4.13)' f1=Q-x, 


which is legitimate by (21.4.13). Then the remaining commutation relations 
{61,6} =0, {O1.%b =O14, ik=1,...,n 


follow from (21.4.14). It remains to verify that €?—x,—&, vanishes on G, 
which so far we know on J by (21.4.11). Since G is the Hg flowout of J it 
suffices to verify that 


Hy (ej —x,—¢,)={¢, +X 424 —x,—¢,}=0 


which follows from the commutation relations. The proof is complete. 


In some approaches to the study of diffractive boundary problems one 
needs the analogues of Theorems 21.4.5 and 21.4.8 in the homogeneous case. 
(See Section 24.4 for other methods.) To prove them one must reexamine 
the preceding theorems from Theorem 21.4.3 on. 


Theorem 21.4.9. Assume that S and oc satisfy the hypotheses of 
Theorem 21.1.10 and that i is a homogeneous C® involution with i*o=o and 
fixed point set Sy. Then there exist local coordinates with the properties 
stated in Theorem 21.1.10 such that the involution only changes the sign of the 
&, coordinate. 


- Proof. We can follow the proof of Theorem 21.4.3 closely. By the proof of 
Theorem C.4.5 there are local coordinates s,,...,5,, in a conic neighborhood 
of any s,€S_ such that s, is homogeneous of degree 1 and even with respect 
to i for j+2n while s,, is homogeneous of degree 4 and odd. By Theo- 
rem 21.4.3 the Hamilton field of s3,/2 is smooth and is a non-zero element 
in the radical of o at sy, hence non-radial. As in the proof of Theorem 21.4.3 
we have o=f*é where f=(S,,...,5,_ 1353,/2) and & is a closed non-degen- 
erate homogeneous two form in U,={teU;t,,20} where U is a conic 
neighborhood of f(s,). Thus ¢=d@ if @(.)=6(0/dt,.), for 0/ét is the radical 
vector field. A homogeneous extension of the one form @ gives a homo- 
geneous closed non-degenerate extension of é to a full conic neighborhood 
U’ of f(s). Since 

F* {tay U5 ={93,/2,f *u}, 


is not 0 at s, for all homogeneous u vanishing at f(s,), the Hamilton field of 
t,, is not radial. By the homogeneous Darboux Theorem 21.1.9 we can 
therefore choose homogeneous symplectic coordinates y, 7 in U’ such that 
Ny =l2,, Then x,=f*y,, ¢,=f*n, for k#1, €;=s2,, have the required 
properties since f*4, =s3,,/2=€7/2. This completes the proof, which is inde- 
pendent of Theorem 21.1.10. 
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The analogue of Theorem 21.44 in the homogeneous case is more com- 
plicated both to state and to prove: 


Theorem 21.4.10. Let S be a conic manifold of dimension 2n with a two form o 
which is homogeneous of degree 1 and _ satisfies the hypotheses in 
Theorem 21.1.7. Let f and g be two homogeneous C® involutions of S with 
f*o=o, g*o=a, having S, as fixed point set, and assume that their reflection 
bundles at y, and the radial vector p(y) are linearly independent. Then n=2 
and in a conic neighborhood of y, one can choose coordinates (x,€) homo- 
geneous of degree 0 and | respectively and equal to (0,¢,) at yo, such that 


(21.4.16) o=d(E3/é,) Adx, +3 ae, rds, 
2 


(21.4.17) ; f(x, C)=(x, ess Go isets Gas 
a(x, €)=(x%, +2E,/E,,X9, ---,X_ —$(E,/E,)’, —€45E35 0-5 E,) 


The preceding normal form really comes from the intersection of the 
glancing hypersurfaces at the end of Section21.1; we shall return to this 
point in Theorem 21.4.12. 


Proof. As a first step we show that a, f| g as given in (21.4.16), (21.4.17) 
satisfy all the hypotheses. To do so we observe that the reflection bundles 
are generated by (0,¢,) and (—e,,¢,¢,), hence independent of (0,¢,). That 
f*o=c is clear and also that we have involutions. Finally g*o=o because 


ASt/E) A d(2E 1/E,) 48, 4 dB (E4/6,)°) 
= 28t/Er dE, A E/E) —2(E 1/6 ,)° 46, A AE 1/6) =0. 


Theorem C.4.8 in the appendix really says that all conic manifolds with 
homogeneous involutions satisfying the hypotheses are equivalent. By Theo- 
rem 21.4.4 the radical of the symplectic form o restricted to S, is in the 
linear span of the reflection bundles, so the hypotheses of Theorem 21.1.10 
are also fulfilled. By Theorems 21.4.9 and Theorem C.4.8 in the appendix we 
can choose homogeneous coordinates x, € such that o and f have the 
desired form, and also homogeneous coordinates y, 4 such that f and g 
have the desired form; both sets of coordinates are equal to 0,¢, at yo. If we 
think of &,(2/&,)* as a variable instead of €, we see that the situation is 
essentially the same as in the proof of Theorem 21.4.4. One difference is that 
y, i$ not invariant with respect to g, but y,+n7y,/3n? is. With o jj Wj as in 
the proof of Theorem 21.4.4 we therefore set 


% = 90,y*,0,7), § =4j0,y*,0,n'), 7 #1, 


where y* =(V5,..-5Vn—15¥n +94 93/302). Then X, and og are invariant under f 
and g. We define p,, q,,...,p, aS before but postpone the choice of q, since 
the Hamilton field would become radial (see also the proof of Theo- 
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rem 21.1.9). Instead we determine p, using the commutation relations with 
P25--->P, and the boundary condition p,=y4, when p,=...=—q,_ ,=X,=0. 
Next q, is determined with g,=0 when p,=...=q,_,=%,=x,=0. Finally 
q, is obtained by integrating the commutation relations with boundary 
conditions where py=q,=...=q,_,=X,=0. The coordinates p,, q; auto- 
matically get the required homogeneity because of the homogeneity of the 
equations and the boundary conditions posed. However, in the choice of the 
last two coordinates q, and q, the boundary conditions were not invariant 
under g so we have no information how they are related to g. 

If we take q and p as new local coordinates, changing notation so that 
they become x, é, then (21.4.16) holds, f is given by (21.4.17), and 


g(x, Ey=(x, +(x, é), Xa> crea Xn dy X, + w(x, é), = Cis Cs, wees é,) 
for some v, w homogeneous of degree 0. Since g preserves o we have 


(21.4.18) d(é2/E,) \dv+dé, Adw=0 


which implies that v and w can only depend on the variables ¢, and ¢,. The 
others no longer play any role so we simplify notation by assuming n=2. 
We can write v(&)=V(t), w(é)=Wt) where €,=t&,, which simplifies 
(21.4.18) to t?d&, dV +dé, ndW =0, that is, 
(21.4.18)’ t?V(t)+ W(t) =0. 
That g is an involution means that V and W are odd functions of t. Note 
that V(t)=2t implies W(t)= —2t3/3 by (21.4.18) since W(0)=0, which ex- 
plains the look of (21.4.17) and shows that it suffices to make V(t)=2t. 

As at the end of the proof of Theorem 21.4.4 we make a simple change 
of variables by introducing 


K(x, ¢) = (x, S(t), X2+%17(0,6,R(0, 62), t= 04/62. 
Then x*o=o if 
d(é,R*) Ad(x,S)+dé, Ad(x,T)=d(E,t7) adx,, 
which means explicitly, if we write U=R?, that 
(21.4.19) US+T=t?, US'+T'=0,  U'S=2t. 


The middle equation follows from the other two; it is clear that S and T 
can be determined as even functions with S(0)+0 satisfying (21.4.19) by 


S=2t/U', T=t?-21u/U' 


if R is chosen odd with a simple zero at 0. We then have a diffeomorphism 
xk. Now kx commutes with f under these conditions and 


goK(x, €)=(x,S+V(R),x,+x,T+W(R), —€,R,€,). 
If g, is defined as g with V and W replaced by V, and W, then 
Ko gi(x, )=(X, +V,)S, x, +W, +(x, 4+Vi)T -o2R, 62), 
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SO goK=Kog, if 
V,S=V(R), W,+V,T=W(R). 


In particular, V,(t)=2t if 


V(R)=2tS(t)=2t7/R(t) R'(0, 
that is, 


2103/3 =| V(s)s ds. 


Since V’(0)+0 because the reflection bundles are different, this determines R 
as a function of t with a simple zero at 0 (see the proof of Theorem 21.4.4), 
so we have obtained a change of variables giving o, f and g the desired 
form. The proof is complete. 


In the applications of Theorem 21.4.10 which follow the important point 
is not the explicit normal form given but that all S, o, f| g satisfying the 
hypothesis are equivalent. We can therefore use any other more convenient 
model. 

We come now to the homogeneous version of Theorem 21.4.5. The 
normal form for a homogeneous canonical relation from T*(R")~0 to 
T*(R")~.0 which we shail use is perhaps best described by means of the 
phase function 


(21.4.20) (X,Y, 8, C= Kx —y, E> +88, —8°E,/35 
x, y,€ER", €,>0, seR. 
The equation ¢,=@,=0 means that 
x,—-y,+s=0;  x,—y,—5°9/3=0;  xj=y;, 1<j<n;  €,=s76,. 
The corresponding canonical relation is 
(214.21) G={(x,E ym); 1-7 b = E15 3 XW +1 — 1)? =, 
C=, X;=yj, 1<j<n, €,>0}. 
With the parameters x,s,é’=(€,,...,€,) we have 
(2.4.21) = G={(x,87,, &',X4 +S, X95 0065 Xp~_ 1) Xp, 57/3, 87 E,, ©); 
n> 0} 


which shows that G is a 2n dimensional manifold. In view of the symmetry 
between x and y it follows that we have fold maps to each factor 
T*(IR")\0, the range being defined by €,20 and 4,20 respectively. The 
kernel of the differential of the map when s=0 is the y, direction and x, 
direction respectively. With the coordinates in (21.4.21) the corresponding 
involutions are s++—s and st>—s, x,+>x,+2s, x,t0x, —2s7/3 respectively, 
and the pullback of the symplectic form to G from either copy of T*(IR”) is 


d(s?&,)Adx, +) dé, adx;. 
2 
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Theorem 21.4.11. Let S, and S, be 2n dimensional conic symplectic manifolds 
with symplectic forms Cee and let G be a homogeneous canonical relation 
from S, to S, near (s°,s$)eS,xS,. Assume that the projections 1;; GS, 
have folds at (59, s$) and that the caninical one forms w, and Or in S, and Ss; 
lifted to G do not both vanish on the tangent plane of G at (s°,s9). Then n>? 
and we can choose local homogeneous symplectic coordinates (x,€) in S, and 
(y,7) in Sz, equal to (0,€,) at s? and s9, so that G is locally given by (21.4.21). 


Proof. The radial vector p at (s°,s$) is not the sum p,+/, of tangent vectors 
to G with zero component in S, and S, respectively, for by hypothesis at 
least one of the components of p in T, (Sy ) and T(o(S,) is not in the 
Lagrangian tangent plane of G. The hypotheses of Theorem 21.4.10 are 
therefore fulfilled by G, o,=x*o,=7%o,, and the involutions f,, f, defined 
by 7,, ™. In particular, this is true for the model we are aiming for, so we 
can choose local coordinates t,,...,f,,5, T2,...,T, in G, with t,,...,t,,5 ho- 
mogeneous of degree 0 and 1,,...,t, homogeneous of degree 1, t,=1 and 
the other coordinates 0 at (s°, 9) so that 


Gg=a(s*t,) Adt, +) dt, adt,, 
2 


filt,s, 0) =(t, -—s,0'),  fi(t,s, cv) =(t, +25, 05, ...5t,_ 1st, —259/3, —8,7’). 
Then there are C® functions x, € in S, and y, 7 in S,, homogeneous of 
degree 0 and 1 as usual, such that 

miXj=tj apo =nin=t, J+, att, =nin,=s"t,, 
mhy,=t; jf=2,..,.n-1, my =t,+s, mty,=t,—s°/3. 
They form local homogeneous symplectic coordinates in a neighborhood of 
(0, é,) when €, 2 0 and 7 2 0 respectively. As in the proof of Theorem 21.4.2 
they have unique modifications when €, <0, 7, <0 to symplectic coordinates 
in a full neighborhood of (0,¢,) with x, and y, unchanged. These are 


automatically homogeneous because of the uniqueness. The proof is com- 
plete. 


Finally we shall prove the homogeneous form of the equivalence of 
glancing hypersurfaces. Here we take the model surfaces in Example 21.1.11, 


F = {(x, €2)E T*(R")\0, x, =0}, 
G={(x, JeT*(R,€,_ 1. >0,65=x16F 146, 16}, 1 >2. 
The intersection is transversal, 
FOG =F ={(0,x 61) Sn St/Sn— 1} 
is parametrized by x,,...,x,,&;,---,¢,.1 and the restriction of the symplec- 


tic form to J is : 


n- 


ol, =d(Et/é,_ 1) A dx, + » de; Aax;. 
2 
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The Hamilton foliation of F consists of the lines parallel to the €, axis so 
the corresponding involution i, just changes the sign of €,. Since 


Casta on eadee eee eee 


x, +6,/€,-4 and x, 1 —(€,/€,-1)°/3 +416 1/n—1 


are constant on the Hamilton foliation of G we find that the corresponding 
involution ig changes ¢, to —€,, x,_, to x,_,—2(&,/E,_ ,)°/3 and x, to x, 
+2€,/€,_, while leaving the other coordinates unchanged. The reflection 
bundles when €, =0 are given by (0,¢,) and (—e,,€,_,8,), and since €,_ , +0 
they are linearly independent of the radial vector (0, €).. 


Theorem 21.4.12. Let S be a conic symplectic manifold of dimension 2n=6 and 
symplectic form o, and let F, G be two conic C® hypersurfaces in S with a 
transversal glancing intersection at syéS. Assume that the tangents of the 
foliations of F and G at sy and the radial direction are linearly independent. 
Then there are local homogeneous symplectic coordinates (x,&) equal to 
(0,6, 4) at So, such that F is defined by x,=0 and G is defined by 
Et =x, soar Gy Cats 


Proof. All the hypotheses of Theorem 21.4.10 are fulfilled by J=F OG with 
the form o|, and the involutions i,, ig defined by the Hamilton foliations of 
F and G. This proves that n—122 and that we can find local coordinates 
ty,---,t,-1, Ty,++:3T,-1 In J, homogeneous of degree 0 and 1 respectively, 
with t,_,=1 and the others 0 at sy, so that 


n-1 
o|,;=d(t7/t,_ 4) Adt,+ » dt,adt,; 


ip changes the sign of t,, and i, changes the sign of t, and replaces t, by 
t,+20,/t,45 ty bY ty —2(t,/t,_ 1)°/3. The proof of Lemma 21.4.7 shows 
that we can find homogeneous symplectic coordinates y, 7 in S, equal to 0, 
&,_, at Sg, such that F is defined by y, =O and 


(21.4.22) F= {QVM 05M 1 91/Mn— 135 


hence i, has the desired form. In fact, the proof of Lemma 21.4.7 works with 
no change until we have obtained a coordinate system where F is defined 
by y, =0 and G by ni —c(y,n')=0 where c is homogeneous of degree 2 and 
d,,,,¢ is not proportional to dy,_, at (0,¢,_). Set co(y’,4)=c(0, y',7’). We 
can choose $(y’,') homogeneous of degree 1 with {cy,@}=0 and ¢=1 at 
(0,¢,_,), for H,, does not have the radial direction so we can put homo- 
geneous initial data on a conic surface transversal to H,,. Then c,/@ is 
homogeneous of degree 1 and {c,/¢, 6} =0, so we can use Theorem 21.1.9 to 
find a homogeneous symplectic coordinate system z,,...,2,, €5,.--5¢€, with 
C,-1=@ and ¢,=c,/d. Changing notation from 2’,f’ to y,7' we have 
achieved (21.4.22). 
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We can now define our new coordinates x, € on J by (21.4.11) except 
that €&, = 7; 2/T,—1. This makes (21.4.12) valid and we can find extensions 
R15 -0eyXny &,---€, to F as before. The definition of Q in (21.4.13) must be 
changed; the terms have different homogeneities. However, x,€,_,+€, is 
now invariant under ig, homogeneous of degree 1, and has the same 
differential as x,+€, at (0,¢,_,). We can therefore find a C® extension Q 
which is homogeneous of degree 1 and constant on the G foliation so that 


(21.4.13y Q=x,f,.,+6, onJ.. 


Hg is still transversal to F, so we can extend &,,...,%,, €5,...,€, by solving 
the differential equations 
(21.4.14) Hox,=0, j+1, 1-1; Hox,=1, Hgx,_1=%,; 

Hoé;=0, jtin, Hoé,=—E,-1 
with initial values X,, ip on F. With €,=Q-—x,é,_, we then have the 
desired coordinate system as before. 


21.5. Symplectic Equivalence of Quadratic Forms 


The general purpose in this section is to study the simplest singularities of a 
Hamiltonian vector field. Recall that a vector field v in a manifold X is said 
to have a singularity at xeX if v>=0 at x; then 


prod(vd)(x), peC*(X), 
defines a linear map T,* > T* with adjoint V: T+ T, defined by 


<Vt,dp>=<t,d(vd)> if peC?(X), teT,. 


In local coordinates the matrix of V is of course (6v,/0x,);4_1,..... We shall 
use this invariant concept for Hamiltonian vector fields: 


Definition 21.5.1. If S is a symplectic manifold and feC7(S) is real valued, 
df =0 at syéS, then the linear map F in T,,(S) defined by H ,/2 will be called 
the Hamilton map of f. 


Some authors use the term “fundamental matrix” which seems awkward 
when one wants to have an invariant setup. The factor 4 is just a traditional 
convention which will be seen to be convenient. The Hamilton map is of 
course determined by the Hessian f” of f at s,, so no generality is lost if 
we assume that S is a real symplectic vector space and that f is a real 
quadratic form Q. The definition of F (at 0) is then that for X eS and linear 
functions ¢ 


(FX, $) =X, H99/2)= —{X, H4Q/2>= —{X, O(., Hg)> = — Q(X, Hy) 
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where Q(X, Y) denotes the symmetric bilinear polarized form of Q, that is, 
Q(X, Y)=Q(Y, X) and Q(X, X)=Q(X). The left-hand side is o(FX, H,4) so the 
definition of F is equivalent to 


(21.5.1) o(¥,FX)=Q(Y,X);  X,YeS. 


(There is no factor 2 here thanks to the factor 4 in the definition.) In 


symplectic coordinates (x,¢) the matrix of F becomes “the fundamental 
matrix” 


(21.5.1 2-1 (_ 


The symmetry of Q means that 
(21.5.2) o(FX,Y)= —o(X,FY), 


that is, F is skew symmetric with respect to o. Note that the preceding 
definition is still applicable if Q is complex valued provided we replace S by 
its complexification 


Se={X +iY;X, YeS} 


with the obvious complex symplectic structure. To study the symplectic 
classification of quadratic forms by means of the spectral decomposition of 
the Hamilton map F we must anyway make this extension even if Q is real. 
Let V, denote the space of generalized eigenvectors of F in Sg belonging to 
‘the eigenvalue AEC. 


Lemma 21.5.2. o(V,, V,)=0 if A+u+0. 


Proof. F + is a bijection on V, when 2+ +0, and 
o(F +p)" V,, V,)=o(V,,(—F +p)" V,) =0 


if N is large enough. 


From the lemma it follows that V, is the dual of V_, with respect to the 
symplectic form. Thus V, and V,@V_,, 4+0 are symplectic vector spaces, 
orthogonal with respect to o and invariant under F, hence Q orthogonal 
too. We shall use these observations to determine the structure of F and Q 
in the cases of interest for us, namely when Q is real and positive semi- 
definite or hyperbolic in the sense that the negative index of inertia is 1. In 
both cases Q20 in a hyperplane, so Q cannot be <0 in a two dimensional 
plane which only meets 


RadQ={X;Q(X, Y)=0 for all Y}=KerF cv, 


at the origin. If ReA+0 it follows from Lemma 21.5.2 that Q vanishes on 
Re V,<V,+V, so the dimension of this space must be at most 1. Hence 4 
must be real and V, one dimensional if V, is non-trivial. Choose a real basis 
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vector e, for V, then and a dual one ¢, for V_, with a(e,,e,)=1. Then Fe, 
=e, Fe, = —Ae,, and Q(e,, €;)=Q(E,,€,)=9, Qle;,e,)=A, so 


O(x,e, +, 8,)=24x, €). 
Assume now that A=iy, p>0O and that FX =iuX, X=X,+ixX,+0. Then 


Q(X, X)=0(X, FX)=o(X, ipX) 
so 
—2pyo(X ,,X,)=O(X, X)=Q(X ,) + O(X,), 
Q(t, X,+t,X)=a(t,X,+t,X,, F(t, X,+t,X,)) 
=o(t,X,+t,X, —tyuX,+t,uX ,) 
= —po(X ,,X2)(t7 +t). 


Since Q cannot be <0 in a two dimensional plane not meeting the radical 
we conclude that o(X,,X,)<0 so we have an invariant symplectic two 
plane. Normalizing so that o(X,,X,)=—1 we can take X,, X, as basis 
vectors there in a symplectic coordinate system, which makes Q equal to u 
times the Euclidean form, and can then pass to examining the o and Q 
orthogonal complement. Repeating this argument we find that S is the 
direct sum of symplectically and Q orthogonal subspaces where Q is given 
by u(x7 +7), w>O, or 2Ax,é;, A>0, and the real part Vj of Vy. It only 
remains to study Vj, so we assume from now on that S=V,, that is, that F 
is nilpotent. 
If one can find X eS with 


F?X=0, Q(X)=0(X,FX)+0, 
then X, FX span a two dimensional F invariant symplectic space where 
QO(t,X +t,FX)=o(t,X +t, FX,t,FX)=t7 Q(X), 


and we can pass to studying the o orthogonal complement. Such a re- 
duction is also possible if KerF contains two elements X and Y with 
o(X, Y)+0. What remains is therefore to study the situation where 

a) Ker F is isotropic 

b) F?X =0 > Q(X)=0. 

By our hypothesis b) implies that Ker F?/Ker F has dimension at most 
one. In the exact sequence 


O0—— Ker F —— Ker F? £> Ker Fn Im F —>0 
we have Im F =(Ker F)’>Ker F by a), so 
0—— Ker F—— Ker F?*> Ker F —>0 


is exact and KerF has dimension 1. It follows now from the Jordan 
canonical form of F that there is a number N and an element X such that S 
has the basis X,FX,...,F’-1X while F’X =0. The dimension N must be 
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even and >2. Now 
OFX, FEX)=(—1f o(X, Ft 1X) 


vanishes if /+k+12N, so Q vanishes in the two dimensional space spanned 
by FX-3X, FN-?X if 2(N—3)+12N, and it does not meet Ker F. Hence 
2(N —2)<N, that is, N=4. We must also have o(X,F*X)<0 since Q <0 in 
the two dimensional space spanned by FX and F?X otherwise. We normal- 
ize so that o(X, F?X)= —1 and set Y=X +tF*X. Then o(Y, FY)=o(X, FX) 
—2t=0 if t=o(X, FX)/2. A symplectic basis is thus given by 


e,=—-F?Y¥, e,=—-FY, ¢,=Y, @,=F’Y. 


With the corresponding symplectic coordinates we have Q=x3—2£,¢é,. We 
sum up our conclusions as follows: 


Theorem 21.5.3. If Q is a positive semi-definite quadratic form in the symplec- 
tic vector space S of dimension 2n then one can choose symplectic coordinates 


x,& such that 
k+l 


k 
(21.5.3) Ql )=Lajeg+ei)+ Y x7. 
1 k+1. 


where u,>0. If Q is hyperbolic, that is, has negative index of inertia equal to 
1, then the symplectic coordinates can be chosen so that 


k k+l 
(21.5.3) Ox, H=V ulx7+of)+ Y x7 +(x, 8), 
i k+i 
where ,;>0 and either k+1<n—1 and 
(21.5.4) q(x, C)= x, - 26,1, 
or k+1<nand 
(21.5.5) g(x,2)=—x2 or q(x,2)=2Ax,€,,  A>0. 


The cases listed are of course all inequivalent; iu, are the eigenvalues of 
F on the positive imaginary axis, counted with multiplicity, (21.5.4) occurs 
when there is a 4x4 box in the Jordan decomposition of F, and the second 
case (21.5.5) means that A is an eigenvalue of F. 

We shall now give a modified form of the result when .Q is complex 
valued and ReQ is positive definite. More generally the results in the semi- 
definite case have analogues when 


(21.5.6) [Im O(X)|S C ReQ(X), XeS, 
as we shall now assume. Set 
(21.5.7) T={weC; |imwi<C Rew}, 


which is then a convex angular region containing the values of Q. We could 
use any such angle but (21.5.7) is convenient in the proof. 
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Theorem 21.5.4. If (21.5.6) is valid, then 
a) O(X, X)=02 FX =0=F ReX=FImX=0=FX=0 so KerF is’ 

spanned by its real elements. 
b) F?V,=0, that is, 


XEV) Q(X, Y)=0 when o(KerF, Y)=0. 
c) Afiel or Afie—T if 4 is an eigenvalue of F. 


Proof. Write X = X,+iX, with real X,. Then 


Q(X, X)=Q(X ,) + O(X2) 


so (21.5.6) shows that Q(X, X)=0 implies Re Q(X ,)=0, hence Im Q(X ,)=0. 
If ze is so close to 1 that RezQ is positive semidefinite, it follows that 


RezQ(Y,X)=0, YeS, 


hence Q(Y, X )=o(Y, FX )=0 so FX ,=0. The other implications in a) are 
obvious. To prove b) assume that XeSq and that F?X =0. Then FF? X =0 
by a) and 


QO(FX, FX)=0(FX, F?X)= —a(X, FF?X)=0 


so F*X =0 by a) again. Since o(Ker F, Y)=0<> Y=FZ for some Z, we have 
Q(X, Y)=o(X,FY)=0 for all such Y if and only if F?X =O. Finally c) 
follows since FX =AX, X =X ,+iX,, implies 


Q(X ,)+ Q(X 2) = Q(X, X)=0(X, FX) =10(X, X)=2id o(X ,, X)). 

Here o(X,, X,) is real and not 0 unless FX =0 which proves c). 

Let W=SonKerF and set S'=W°/((WOAW’) which is a new symplectic 
space. Then 

Sc=Im F/(Ker FoImF)= @V, 
A+O 

by Theorem 21.5.4 a) and b), and for the quadratic form Q’ induced by Q in 
S’ the Hamilton map is isomorphic to the restriction of F to the eigenspaces 


with eigenvalues +0. When studying these further we may. therefore assume 
that ReQ is positive definite. 


Definition 21.5.5. A complex Lagrangian plane A<Sg is called positive if 
(21.5.8) io(X,X)20,  Xed. 
If io(X, X)>0 when 04 X eA we say that A is strictly positive. 


Note that io(Y, X) is a hermitian symmetric form since 


—io(Y, X)=ia(X, Y). 
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Theorem 21.5.6. If Re Q is positive definite, then 
ST=@V, 


Aeil 


is a strictly positive Lagrangian plane, invariant under F. 


Proof. That S* is Lagrangian follows from Lemma 21.5.2, and the invariance 
under F is clear. To prove that S* is positive we first assume that Q is real. 
Then we can choose symplectic coordinates so that 


Q(x, )=V uj(x7 + €}). 

Thus F(x, ¢)=(u1,---, —H1%1,---) 80 S* is defined by €,=ix, and 

o(,€;x, 2)= —2iy |x? 
which proves the positivity. Returning to the general case we first observe 
that if S* is positive and io(X,X)=0 for some XeS*, then io(X, Y)=0, 
YeS*, by Schwarz’ inequality. Taking Y=FX we obtain Q(X, X)=0, hence 
X=0 so S* is strictly positive. The general proof follows now by a con- 
tinuity argument. Set 


O,(X)= Re Q(x)+itImQ(X), XeS, OStSl. 


The Lagrangian plane S;* defined by Q, varies continuously with t since the 
projection of Sp on S;* is given by 


(2ni)-* | (R—z)"'dz 


where F, is the Hamilton map of Q, and y is a circle in the upper half plane 
containing the eigenvalues of F, there in its interior. Let T be the supremum 
of all te[0,1] for which S; is strictly positive. Then Sf is positive for 
reasons of continuity, hence strictly positive as we proved above. This 
proves that T=1 and completes the proof. 


Remark. Since 
io(X, FX) =iQ(X, X)eil 
the numerical range of the restriction of F to S*, with respect to the 
positive definite hermitian form there, is contained in if. This implies that 
all eigenvalues are in if but is a much stronger statement. 
A strictly positive Lagrangian plane gives S additional structure: 


Proposition 21.5.7. If S* Sg is a strictly positive Lagrangian plane then 
StaXt>ReXeS is a bijection giving S a complex structure. There is a 
unique positive definite hermitian form H in S with respect to this structure 


such that 
Im H(X,Y)=—o(X,Y); X,YeS; 


and ReH gives a natural Euclidean structure in S. 
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Proof. If XeS* then 
0<io(X,X)=20(ImX,ReX), X+0, 


which proves that StsXt->ReX eS is injective, hence bijective since the real 
dimensions are equal. The hermitian form H we look for may be regarded 
as one on St, and then we must have if XeS* 


A(X,iX)=ilm H(X,iX)= —io(Re X, ReiX)=io(Re X,Im X), 
hence _ 
H(X, X)=o(Im X, Re X)=ia(X, X)/2. 


Polarization gives 
(21.5.9) H(X, Y)=io(Y, X)/2 


and proves the theorem since the argument can be reversed. 


One can use Proposition 21.5.7 to choose symplectic coordinate systems 
which are well adapted to a strictly positive Lagrangian plane: 


Corollary 21.5.8. Let S* be a strictly positive Lagrangian plane CS¢ and let A 
<S be Lagrangian. Then one can choose symplectic coordinates x,€ in S such 
that 4 is defined by x =0 and S* = {(x, ix); xeC"}. 


Proof. The hermitian form H in Theorem 21.5.7 is real in A. We can 
therefore choose a real orthonormal basis ¢,,...,¢, in A. Choose X jE5 SO 
that ¢,—iX ,eS*. The definition (21.5.9) of H means that then 


io(e,+iX ;, 6, —-iX,)/2=6;,, 


that is, o(X,,X,)=0 and o(¢,, X,)+o(é,X )=26,,. Since S* is Lagrangian 
we also have o(¢;—iX,,e,—iX,)=0, hence o(é,, X,)=o(¢,, X;) which proves 
that X,,...,X,, €1,--.5&, iS a symplectic basis for S and proves the corollary. 

When we apply the corollary to the space S* in Theorem 21.5.6 we 
conclude that Q(x,ix)=0 with the coordinates chosen. Thus Q is in the ideal 
generated by x,+i¢,, j=1,...,0. 

A strictly positive Lagrangian plane has a very simple representation 
also for arbitrary symplectic coordinates: 


Proposition 21.5.9. Let 4<T*(€") be a strictly positive Lagrangian plane. 
Then there is a symmetric matrix A=A,+iA,. where A, is real and A, is 
positive definite, such that A= {(z, Az); zeC"}. Conversely, every such’ plane is 
strictly positive. 


Proof. From Definition 21.5.5 it follows at once that An T(C”) = {0}. Thus / 
is the graph of the differential of a quadratic form, that is, A= {(z, Az), zeC"} 
where A is a symmetric matrix. If d=A,+iA, with A, and A, real, we 
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have = 
ia ((Z, Az), (z, Az))=2¢A,2,2> 


which proves the proposition. 


The structure of an arbitrary positive Lagrangian plane is clarified by 
the following result. 


Proposition 21.5.10. If 4 <S¢ is a positive Lagrangian plane and Xe, o(X, X) 
=0, then XeA, hence ReXed and ImXe/. 


Proof. By Cauchy-Schwarz’ inequality we have io(X, Y)=0 for all Yea. 
Since 1 is Lagrangian this implies that Xe/. 


Corollary 21.5.11. Let ACS, be a positive Lagrangian plane and let 7, be the 
isotropic plane AS. Set S'=Aj/Ag. Then the image 2’ of A in S¢ is a strictly 
positive Lagrangian plane. 


Proof. That 2’ is Lagrangian follows from the complex analogue of Proposi- 
tion 21.2.13. Since 4’ contains no real elements it follows from Proposition 
21.5.10 that 2’ is strictly positive. 


Alternatively we can write S=S,@S, where S,, S, are symplectically 
orthogonal symplectic subspaces and Ag is Lagrangian in S,. Then / is the 
direct sum of the complexification of Ag in S,¢ and a strictly positive 
Lagrangian plane in S56. 


21.6. The Lagrangian Grassmannian 


Lagrangian manifolds were discussed at some length in Section21.2 in 
preparation for the theory of Lagrangian distributions in Chapter XXV. To 
make that theory global we shall need a careful invariant analysis of the 
infinitesimal case involving Lagrangian planes in symplectic vector spaces. 
This will be provided in the present section. 


Definition 21.6.1. If S is a finite dimensional symplectic vector space with 
symplectic form o, then the Lagrangian Grassmannian A(S) of S is the set of 
all Lagrangian planes in S. 


We shall first show that A(S) has a natural structure as an analytic 
manifold of dimension n(n+1)/2 where 2n=dimS. First we consider the 
subset 

A,(S)= ie A(S); Anpu={0}}, we A(S), 


of Lagrangian planes transversal to a given one. 
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Lemma 21.6.2. A,(S) has a natural structure as an affine space with the 


symmetric tensor product wu as underlying vector space. 
s 


Proof. The bilinear form o(X,Y) with Xep and YeS is zero for all Xeu 
precisely when Yey so it gives an identification S/u— yp’, the dual space of 
hu. If Ae A,(S), then the composed map 


A> S/u 


is an isomorphism, so it has an inverse T which we regard as a map into S 
with range A. It has the properties 


(21.6.1) (X,b>=0(X,Td); Xen, dep’, 
(21.6.2) (Td, TW)=0; db, wep. 


Conversely, every such T: p’-+S defines an element of A,(S). If T is one 
solution of (21.6.1), (21.6.2) and T+R is any other, then by (21.6.1) 


a(X,Ro)=0; Xen, pep’, 
that is, R is a map yp’ > p, and (21.6.2) gives 
o(Tp,RW)+o(RG,TW)=0;  o,wey’, 
since o(R¢, Ry)=0. Thus R corresponds to a bilinear form 
A(o, W)=a(RO, TH)=a(RY, TO); ob, wen’; 


which is symmetric. Conversely, every such A defines a solution R of 
(21.6.1), (21.6.2) as claimed. 


It is useful later to have seen the proof in terms of coordinates also: If 
we choose symplectic coordinates (x, €) so that p is defined by x=0, then 4, 
as a Lagrangian section of T*(IR”), is defined by the graph of the differential 
of a quadratic form A/2 in R’, 


A={(x, Ax); xEIR"} 


where A is the symmetric matrix of the quadratic form. This also identifies 
A,(S) with R""*? since n(n+1)/2 is the number of elements on or above 
the diagonal in an nxn matrix. 

If w, and pw, are two different Lagrangians we can choose symplectic 
coordinates (x, €) such that py, is defined by x=0, and symplectic coordinates 
y=L(x, €), n= M(x, é) such that yp, is defined by y=0. A Lagrangian plane 
AEA, (S)OA,,(S) is defined by 7=Ay and by €=Bx in the two coordinate 
systems. Thus the equations M(x, £)= AL(x, €) can be solved for & and give € 
=Bx, so the coefficients. of B are rational functions with non-zero de- 
nominators of those of A. Hence the local charts we have given make A(S) 
an analytic manifold of dimension n(n+1)/2. (Recall that the Grassmannian 
G(N,k) of k planes in R™ is a manifold of dimension k(N —k), the number 
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of coefficients required to express N —k coordinates in terms of the remain- 
ing k. In particular, G(2n,n)=n? is much larger than dim A(S) except when n 
=1) 

Our coordinate patches leave out a rather small subset of A(S): 


Lemma 21.6.3. If we A(S) then 
(21.6.3) {AE A(S); dim Anp=k} 
is a submanifold of dimension n(n+ 1)/2 —k(k+1)/2. 


Proof. The intersection V=Aqy is an element in the Grassmannian of k 
planes in yp, which has dimension k(n—k). For given V there is a one to one 
correspondence between Lagrangians 4<S with Anu=V and Lagrangians 
xX in S‘'=V?/V transversal to p/V, given by the construction in Proposition 
21.2.13, 2’=A/V. This is easily seen to be a fibration so the dimension of the 
space in (21.6.3) is (n—k)(n—kK+1)/2+k(n—k) as claimed. 


In particular, A(S)~ A,(S) is a null set, the union of manifolds of codi- 
mension k(k+1)/2 for k=1,2,...,n. Any countable intersection () 4,,(S) is 
therefore non-empty which gives a much stronger version of Corollary 
21.2.11. 


In the analytical context in Chapter XXV S will occur as the tangent 
space of a cotangent bundle at some point. Then there is a distinguished 
Lagrangian plane given, the tangent space of the fiber. In what follows we 
shall therefore assume that a distinguished element A,¢A(S) is given. 

For a linear subspace V of IR" the simplest prototype of the conormal 
distributions studied in Section 18.2 is a translation invariant simple layer 
on V. This is a (half density) solution of the equation 


L(x, D)u=0 


for all linear functions L(x, &) vanishing on the conormal bundle V x V° of 
V, where V° is the annihilator of V. In fact, L,(x)u=0 for all L, vanishing 
on V means that u is a simple layer on V, and L,(D)u=0 for all L, 
vanishing on V° means that uw is translation invariant along V. We shall now 
extend this construction to general Lagrangian planes. 

To do so we choose any 4,¢€A,(S), that is, 4, is Lagrangian and 
transversal to the distinguished Lagrangian plane A,. Recall that the bi- 
linear map 

Ag X A, 2(X, Y)r+a(X, Y) 


is then non-degenerate so it defines an isomorphism 4,—>A, which we can 
use to identify half densities in 4, and in 4). Introducing symplectic coor- 
dinates x, € in S adapted to 4, in the sense that x=0 in Ay and €=0 in A,, 


21.6. The Lagrangian Grassmannian 331 


we define for any Ae A(S) 
(21.6.4) I(A,4,)={ueS'(Ao, 2); L(x, D)u=0 for all 
linear forms L(x, €) vanishing on 2}. 


Here Q? is the half density bundle. Note that the definition is independent 
of the choice of basis in A), thus independent of the choice of dual basis in 
A,. We shall prove that the dependence on /, is very mild and that [(A,2,) 
can be used to define the fiber of 4 in a line bundle on A(S). - Note that if 0 
+uel(d,4,) and L(x,D)u=0 for some linear L, then {L,L}u=i[L(x, D), 
L(x, D)]ju=0 for all linear EZ vanishing on 4. Thus L vanishes on 4, so the 
system of equations in (21.6.4) is maximal, and / is determined by u. 

With the symplectic coordinates x, € adapted to A, any other /,€A, (S) 
can be written in the form 

Ay ={(%, );€=Ax} 


where A is a symmetric matrix. Thus the coordinates y=x and n=&—Ax 
are symplectic and adapted to A,. If L(x,é)=0 on A we obtain L(y,y+ Ay) 
=0 on / in the y, 4 coordinates. Now 


L(x, D)u, =0 = L(y, D+Ay)u,=0 
if u,(y)=u,(ye74””/?, The map 
(21.6.5) I(A, 4, )au,, rou, e402 =u, el(A, dy) 


is therefore an isomorphism allowing us to identify I(A,4,) with I(A,A,). It is 
coordinate independent since it can be described as in the proof of Lem- 
ma 21.6.2, and it is transitive. The set I(A) of equivalence classes can be 
thought of as the set of maps 


A,,(8)34, Hu, €1(A,A,) 


satisfying (21.6.5). For every 4,€A,,(S) we have of course an isomorphism 
I(a,A,) > 1(A). 

For every Ae A(S) there is some /,€A,,(S) transversal to 4. It is then 
easy to describe [(A,2,): 


Lemma 21.6.4. Let 2, be transversal to 4 and 4,4, and choose symplectic 
coordinates x, € such that 4, and 2, are defined by x=0 and €=0 respective- 
ly. Every uel{A,,) is then an oscillatory integral 


(21.6.6) u(x) = c(2m)- 304 f eo? 4885912) ge 
where B is the symmetric matrix defining 2 through 
(21.6.7) A={(BE,¢); CER", 


and c is a constant. 


Proof. Since A is transversal to 4, we can define A by an equation x=Bé, 
and B is symmetric since A is Lagrangian. The equations defining I(A,1,) are 
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then 
xju—(BD)u=0; j=1,...,n; 


or after Fourier transformation 
Di+Béua=0. 


The solution is a(€)=c exp(—i<¢Bé, €)/2) for some constant c, which gives 
(21.6.6) with another constant. (We have inserted a factor (22)~3"* in 
(21.6.6) to get agreement with (18.2.11)’ when k=n. A better justification will 
follow shortly.) 


Remark. It is easy to evaluate the integral (21.6.6) using Theorem 7.6.1; if 
«BE, E> =<B'&,&) is a non-singular quadratic form in ¢'=(€,,...,€,) then 


(21.6.6) w= c(2m)™—2BI4 gKB EDI? 5 (x") e~ GE BYI4 | det BY 4, 
In view of (21.6.5) it follows for arbitrary 4 and for 4,€A,,(S) that I(A,4,) is 
a translation invariant simple layer on a subspace multiplied by a Gaussian 


of constant absolute value. However, such representations will be avoided 
since they hide how nicely (21.6.6) depends on B, that is, on A. 


For fixed A,¢A,,(S) and arbitrary 4¢A,,(S) we can identify (A) with € 
by first identifying I(7) with I(A,A,) and then identifying ueI(A,/,) with ce€ 
when (21.6.6) holds. We shall prove that this makes 

I= {(u, A); uel(A)} 


a line bundle over A(S). To do so we must determine the transition function 
which the coefficient c in (21.6.6) is multiplied by when 4, is replaced by 
another Lagrangian plane 4,¢A,,(S) such that Ac A,,(S). First note that it 
foliows from (21.6.6) that the oscillatory integral of u is well defined; we 
have 


(21.6.8) fu(x)dx=(2n)"*e. 


Since u is a half density, c transforms as a —1/2 density in A,, hence c|dé|? 
is a half density in the dual space 4, depending only on the choice of A, and 
not on the choice of x coordinates. Now the projection 


Ty, AAG 


of A on Ay along A, is bijective since 1, 7A= {0}, so the half density c|dé|* 
can be lifted to a half density 


uf =n cldels 


in A. When 2,€A,,(S) the element wel(A,A,) defined by (21.6.6) is according 
to (21.6.5) identified with vel(A, 4.) defined by 


v(y) =C(2m)— 38/4 e—KAMIIID f gil) —<BEED/2) ge, 
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If Ae A,,(S) we can define uz, as we defined Se noting that when v is 
written in the form (21.6.6) then the coefficient c is replaced by 


oe =(22)7"'* {u,(y) dy=c(2n)-" ff ef KY 5) —< BS, 29/2 ~<AY.V?/2) Je dy 
—B I 

t 
det ( re 4 


: : -A |! ; : 
where s is the signature of the matrix ( I a It is non-singular 


ae 
2 


= cenisl4 


> 


because / and A, are transversal, which means that the equations x —B&é=0 
and €-Ax=0 only have the solution x=€=0. The matrix has 2n rows and 
columns, so the signature is an even number, hence e”'*’* is a power of the 
imaginary unit. 

Next we compare the densities xj,|dé| and nj,|dy| on A. In the x,¢ 
coordinate system € parametrizes the point (Bé,é)e/, and in the y, 7 system 
4 parametrizes the same point if n=f—-Ay=€—ABE, so 1,77, E= 
(I —AB)é and 

nt, |dn|=|det (I — AB)| w¥,|d€ 


-B I 
Since |det (J — AB)|=|det ( I A we obtain 

. —-A I 
(21.6.9) uf =e4y*,  s=sgn ( I ee) 


Apart from the powers of i here we have thus been able to identify J(A) with 
the translation invariant half densities Q(A)? on A. 


The integer 


-A 1 
(21.6.10) O igs Ai ay 2s) =4 en ( ; 3) 


has been defined for any four Lagrangian planes Ay, A, A,, 4, such that each 
of the first two is transversal to each of the last two, by means of a 
symplectic coordinate system such that 


(21.6.11) Ag: X=0; AL x=BE; A: €=Ax; A: E=0. 


For reasons of continuity o is constant when dy, A, 42, A, belongs to a 
connected open set in A(S)* where these transversality conditions are valid. 
When uel(A) we have assigned to every Lagrangian A, transversal to A, and 
to A a translation invariant half density uj on A such that, if A, is another 
such Lagrangian, 


(21.6.9) ut jordan dnd ye 

Thus we have modulo 4 

(21.6.12) GU tea = Solas 
(21.6.13) o(Xo, A; Az, A1) = T(Ao, Aj AZ, Az) + O(A0, AS Az, Ar) 
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when all terms are defined. These equalities are in fact valid exactly. It 
suffices to prove (21.6.13) which gives (21.6.12) when 4,=/,. Let Ag, A, a2, 
A, be defined by (21.6.11) and 4, by €=A’x. With the coordinates y= x, 
n=€—Ax adapted to 4, we have the equation 4=(A’—A)y for 4, and 
y=B'y for Aif BE = B’(E — ABO), that is, B’ = B+ B’ AB (the resolvent equation !). 
Thus we must prove that 


o y= e lew I +s . ie 4 

ONG Boe NL. See OR KL eB 

In doing so we may assume that B and B’ are invertible, for the equality is 
stable for small perturbations. Now 


Oo Cr called d(C" oe) 


so the equality becomes 
sgn(—A’+B~‘+)—sgn B=sgn(A — A’+B’~')—sgn B’-! 
+sgn(—A+B-')—sgnB 


which is true since B~=B’~!+ A. This proves (21.6.12), (21.6.13). We also 
have the anti-symmetry 


(21.6.14) G(Ags A App Aq)= —O(A 13423 Ay Ao). 


It follows from the symplectic change of variables y= —&, y=x, which 
makes A, defined by 7=0, A by 7= —Bx, 1, by y= —Ayn and A, by y=0, 


and 
Ale EN 6 s 
sen ( I 3)- sgn tal 


Definition 21.6.5. The complex line bundle M defined on A(S) by the 
covering with the open sets A, (S), 4,¢A,,(S), and the transition functions 


(21.6.15) Pia: = j0(40+ As 2, As). 4€A, (S)OA,,(S), 


is called the Maslov bundle on A(S). 


Thus we have identified J with the tensor product M@? of the Maslov 
bundle and the bundle with fiber at 2 equal to the translation invariant half 
densities in 4. One should think of the elements of M@? as the symbols of 
the elements of I. 


If we apply (21.6.5) with some /, transversal to 1 and 4,, and use 
Lemma 21.6.4 to represent u,,, it follows that every uel(A,4,) can be written 
in the form 

u(x)=(2n)-3"* (el de, xed, 
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where Q(x, €)=<x,€>+(<Ax,x> —<BEé, &>)/2 for some symmetric A and B. 
Thus we have such a representation even if A is not transversal to 1,. We 
shall now study even more general representations of the form 


(21.6.16) u(x)=a(2n)~* 2/4 [iW 9G9  xEd,, 


where Q is a real quadratic form in 2, xJR™ and a transforms as a half 
density if the coordinates x are changed. At first we assume that Q is non- 
degenerate as a phase function in the sense that 0Q/06,,...,0Q/00y have 
linearly independent differentials. Since XG 0Q/06, is identically 0 precisely 
when (0,f) is in the radical of Q, an equivalent condition is that 
-(0, t)e Rad Q implies t=0. Thus 
8 S C(leQ(x, 8)/0x| + |6Q(x, 8)/00|) if Clx|<|4, 

which makes (21.6.16) defined as an oscillatory integral. (The homogeneities 
are not the same as in Section 7.8 but the arguments used there apply. 


Regarded as a phase function Q does not have the homogeneity assumed in 
Section 21.2 either.) We have 


L(x, D) u(x) =a(2n)~ "* 24 [ L(x, 0Q/8x) &2™ dd=0 
if L(x, 0Q/0x)=)\t,6Q/00; for some t,, that is, if L vanishes on the La- 


grangian 
A= {(x,0Q/0x); 0Q/60 =0} 


parametrized by Q. Thus wel(A, A,). 
If AeA, (S) we can define 4 by (21.6.7), and u must be of the form (21.6.6) 
with 
c=(2n)~"4 fu(x)dx =a(2n)—"*™?? [fi dx dd. 


Here @Q is non-singular since 0Q/00=0Q/dx =0 implies (x,0)eA, so x=0 by 
(21.6.7). The integral can thus be evaluated by Theorem 7.6.1 which gives 


(21.6.17) c=aet!4s82 det Q"|-4; 


the power of 27 in (21.6.16) was chosen so that we get cancellation here. 
There is an important geometrical interpretation of (21.6.17). The map 


R"*%3(x, 0) Q5(x, OER” 
is surjective, so the pullback 5(Q%) of the 6 function in IR* is a density de on 
C= {(x, A)eEIR"*®; O6(x, 6) =0}. By (6.1.1) we have 
d-=|D(t, 0Q/08)/D (x, 8)|~ * \de| 
if t;,...,f, are linear functions in IR"*% which restrict to coordinates on C, 
and |dt| is the Lebesgue density on C defined by these coordinates. Since 
Ca(x, ™)H+(x, d0/dx)ed 
is a bijection for any 2, we can always regard d, as a density on 4. When A 
is transversal to 2,, we can take t=0Q/0x, for the restriction to C agrees 
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with €, Thus we have 
di=|det Q"|-* |dé|* 
where both sides are considered as half densities on 4. If we recall that u¥, 
=c|d€é|?, it follows from (21.6.17) that 
(21.6.17) uj, =adZ et!/4sen2, 
If we have another representation 
(21.6.16) fi(x) =G(2n)— +24 | gO. Qh 


of the same element as in (21.6.16), then O must parametrize the same 
Lagrangian as Q, and it follows from (21.6.17) that 


(21.6.17)’ ad3 erisenQ— sen Qvi4 _ G gd. 
The signature difference is not influenced by the x dependence, for we have 
(21.6.18) sgn Q —sgn O =sgn O(0, .)—sgn O(0, .). 


(The signature of a singular quadratic form is also defined by the con- 
vention sgnt=t/|t] when teIRNO and sgn0=0.) To prove (21.6.18) we first 
change the @ coordinates so that Q(0,6) is a non-singular quadratic form 
Q,(@”) for a splitting 6 =(6’, 6”) of the variables. Then we can write 


(21.6.19) Q(x, 8) =Qo(8" + L(x, 8) + Q(x, @) 
where L is a linear map, Q,(0,.)=0 and 


sgn Q=sgnQ,+sgnQ, =sgn Q(0,.)+sgnQ,. 


It is therefore sufficient to prove (21.6.18) when Q and O are linear in @ and 
@, which implies that the right-hand side is 0. Thus assume that 


Q(x, 0) = q(x) + <x, TOY 
where T: RNR" is injective since Q is a non-degenerate phase function. 


We have 
A={(x,q'(x) +7); 'Tx =0} 


where q(x)=<q'(x),x>/2=<&x>/2 if 'Tx=0, (x,QeA. The kernel W of 'T 
and the restriction q|, are therefore determined by 4. Conversely, and 
ay determine 4 since the range of T is the orthogonal space of . If we 
choose x coordinates (x’,x”’) such that W is defined by x’=0 and make a 
linear change of # variables, we have <x, T@> = <x’, @> with no change of the 
signature. Now the signature of q(x’,x”)+<x’,@> is equal to that of the 
restriction to the plane where the (x’, 9) derivatives vanish, that is, x’=0 and 
4+ 0q(0,x")/0x'=0. Thus the signature is equal to that of q|,, which only 
depends on A. This proves (21.6.18). 
From (21.6.17)” and (21.6.18) it follows that 


(21.6.17)"” Gdt = adi e*#oe 20.-)- senQ00../4 
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when (21.6.16) and (21.6.16) are valid. So far we have proved this only when 
A is transversal to A,. Then it follows from (21.6.18) that the Maslov factor 
in the right-hand side does not change if we make small perturbations of Q 
and of @ so that they still parametrize the same Lagrangian. This con- 
clusion remains true even if A is not transversal to 4,, and so does 
(21.6.17)”. In fact, we can always find 4, transversal to A, and to A. 
Replacing A, by 1, means that —<Ax,x>/2 is added to Q and Q which then 
become non-singular. This does not affect the signature difference in 
(21.6.17)” or the definition of d., de on A. 

We can now give an alternative definition of the Maslov bundle as 
follows. Fix 1, and cover A(S) by open sets w such that 4 is defined by a 
non-degenerate Q, depending continuously on A when dew. For another 
continuous function Q,,i¢@, we obtain a locally constant transition func- 
tion 

eileen Q2(0.)—senGa,- 4 De~rg, 


defining the Maslov bundle. (If we always take the same number of fiber 
variables N, say N=n, then the transition function is a power of the 
imaginary unit.) 

Every real quadratic form Q(x, 6) in 2, x R™ defines a Lagrangian plane 


A= {(x, Qs); Qo = 0}. 


R={0; Q5(0, 0) = Q4(0, 0) = 0} 


be the intersection of the radical and the plane x=0. We choose a splitting 
0=(0', 0") of the @ variables such that Ra0t+6” is bijective. This means that 
R has an equation of the form 6’ =w(6"), so 


Q(x, 0’, 0") = O(x, & —W(8"), 0). 


The quadratic form Q(x, 6’,0) is a non-degenerate phase function defining 4, 
so 


Let 


U =a(2n)-* 204 ( iO G9, xEAy, 


is a well defined element in I(A,1,) independent of 6”. We can interpret 
(21.6.16) invariantly as a translation invariant density on R with values in 
I(A, 4). To do so we take a projection T: R‘ > R and consider 


(21.6.16)” Co(R)ayroa(2n)- tt 24 f gies % ¥(T0) dd. 
Since Q(x, 0)= Q(x, 0’ —(8"), 0) we replace 6’ by 6'+W(@") and note that 


Jx(T(O' + H(8"), 8°) d0" =f x(8"), 8°) d 8” 


because T is linear and T(w(8”), 0”) =(W(0"), 0”). Hence the right-hand side of 
(21.6.16)” is equal to Uf y(W(0"),0”)d0", thus an element of I(A,2,) inde- 
pendent of the choice of T. Thus (21.6.16)” interprets (21.6.16) as an element 
of I(A,A,)@ Q(R). 
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If A is an isotropic subspace of S then S,=A’/A is a symplectic vector 
space (Proposition 21.2.1). By Proposition 21.2.13 every Ae A(S) defines 
another Lagrangian 

A= VUNAY(ANA)EAS J). 


In particular 2), gives a distinguished Lagrangian in S,. We shall now show 
that there is a natural map from the fiber of M@Q"? at 2 to the fiber at A, 
of the corresponding line bundle M,@Q? on A(S,), tensored with the 
density bundle Q(An A) and another factor depending only on A. 

In the proof we use symplectic coordinates putting 4 in a convenient 
position. Since A is isotropic we have 


{O}CANA,CA* NAG CAg. 


We choose a basis ¢,,...,¢, for And, and extend it first to a basis ¢,,...,& 4; 
for A°m/A, and then to a basis ¢,,...,¢, for 49. The symplectic form makes 
Af(ANA,) dual to Ap/(A7N4,) so we can extend ¢,,...,&, with e,,)41,...,e, to 
a basis for A so that the commutation relations are fulfilled. Completing to 
a full symplectic basis we then obtain symplectic coordinates (x, é), 
x=(x’,x’",x'”), E=(6,E",6”) such that A is the é’x’” plane and Ap = {(0, €)}. 
We shall write n’ =k, n’ =I, n’’=n—k-—I for the various dimensions. 

Define 4; by €=0 and recall that I(A) is isomorphic to I(A,4,) where 
every element u can be written in the form (21.6.16) with Q nondegenerate 
as a phase function defining 2. Now consider the integral of u with respect 
to x’” restricted to x’=0, 


(21.6.20) ugle = a(2ay As [fee Mae ab: 
Here we have to take into account the radical in the parameter direction 
R={(x'", 6); 0Q/0x" =00/0x'” =6Q/00=0 at (0,0,x'”, #}. 
The bijection {(x, 0); 09/00 =0} a(x, @)>(x, 6Q/dx)eA sends R to 
{(x, eA; xo =x" = El = E" = OJ HMDA. 
The Lagrangian parametrized by Q(0,x”,x’’, 6), where (x’’,6) is now the 
parameter, is given by 
{(x", 0Q(0, x”, x’, Ax"); 0G/8x'” =0Q/00 =0} 
= {(x", €”); (0, x”, x'”, &, E, O)eA for some x”, €}. 
Since A? is defined by x'=é’"=0 and A is the é'x’” plane, the coordinates 
x",¢” induce symplectic coordinates in S$, and the Lagrangian obtained is 
precisely 2,. Thus (21.6.20) defines an element in I(A,, 4, ,)@ Q(A04), hence 
one in [(1,)@ QUA). Passing to “symbols” we have a map from the fiber 
of M@Q? at A to the fiber of M,@Q4 at A, tensored with QUnA), but so 
far it depends on a number of choices made. 


First we show the independence of the choice of Q. If Q is of the form 
(21.6.19) then integration with respect to the @” variables yields the same 
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factor in (21.6.16) as in (21.6.20). Hence Q and Q, define the same map. We 
may therefore assume that Q is linear in 9, 


O(x, 8) = q(x) + (x, TA). 


Here T is injective, Ker’T is uniquely determined, and so is the restriction 
of q to Ker'T: Adding a polynomial vanishing in Ker'T to q changes 
neither (21.6.16) nor (21.6.20), which proves the independence of the choice 
of Q. 

Suppose now that we keep the x variables but replace € by €— Ax where 
A is a symmetric matrix. We require that A is still the é’x’” plane. Then 
A(0,0,x’”) must be in the &’ direction so the quadratic form <A(O,x”,x’”), 
(0, x”, x’”)> is independent of x’”. Passing to the new coordinates means by 
(21.6.5) that u is multiplied by e~'<4**?/?, and this will only lead to multipli- 
cation of u, by e7'<A4(.*0(0.2"0/2 which agrees with the change of 
coordinates induced in S,. 

Finally, if we keep the plane 1, fixed we can only change the x variables 
and the dual € variables so that the plane x’=0 and the plane x’=x’’=0 are 
preserved, for these are the orthogonal spaces of AnA, and of A°nA,. Since 
u transforms as |dx|?=|dx'|*|dx"|? |dx’"|* when we change coordinates, it 
follows that u, transforms as 

|dx'|? |x" [dx'"|-4, 
that is, as 
\de|-F dx" # dx 


Since (¢’,x’”) are coordinates in A, this means that we have actually ob- 
tained a map 
I(A) > 1(A.4)@ QUNA)@ QAA)-#. 
Passing to “symbols” we obtain a map 
M(A)@ Q(A)*? > Mg(A.4) @ Q(A.4)* @ QUNA)@ QA) ?. 


It is clear that we have a bundle map if 4 is allowed to vary only in a 
manifold where ANA has a fixed dimension. Since the transition functions in 
M and M, have absolute value one and those in the density bundles are 
positive, the map is the tensor product of two uniquely defined maps 


M(A)>M(A,) and Q(4)3 > QA)? @ AAA) @ Q(A)-# 


where the second map preserves positivity. It can also be described in a 
much more direct and geometrical way. In fact, since 1, is the quotient 
space of ANA’ by ANA we have 


(21.6.21) 20.4)? = QANA?)? @ QANA)=?. 
The symplectic form makes A/(ANA°) and A/(ANA) dual, so 
(21.6.22) QA)? @ QUA’)? S QAANA)? ® Q(A)=?. 
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Combination of (21.6.21) and (21.6.22) gives 
QA)? = QUAN A)? @ AANA)? @ Q(A)-# 
= QA ,)? @ AANA) @ A(A)-?. 
We shall prove that apart from a power of 27 this is identical to the 
isomorphism defined above by means of (21.6.16), (21.6.20). When doing so 
we observe that the passage from §$ to S, can be made in two steps, first 
from S to S$, where A’=AcdA, and then from S,, to S,; in that case we have 


AyO4,={O}. It is therefore sufficient to examine these two situations 
separately. 


a) Assume that 4</ and choose the coordinates so that A is the &'x’” 
plane as above. Then x’=€”’=0 in J. Changing the &” variables if necessary 
we may assume that A, is of the form x”=B¢". This means that 1, is 
parametrized by <x”, €") —<BEé", &’5/2 and that / is parametrized by 


Q(x, CY =X, SY +", OY — CBE", 0" /2. 
Thus (21.6.16) and (21.6.20) become apart from the volume factor |d é'||dx’"| 
u(x)=a(2n)-@* anit 2ni4 {| eiQts, 5) qd &’ dé", 
U1 g(”) = a (2) HH BHT LGC E> — CBE ED/2) gr 
with the coordinates &, x’” in A=ANA. Since 6Q/0(€, €") =(x', x” —Bé”) the 
set C={(x, &, €), 0Q0/0(€, €’")=0} is parametrized by &, &’, x’” and d. 
=|dé'||dt"||dx’"|. The positive half density on A associated with u is then by 


(21.6.17) 
lal |d "|? do" |dx'"|*. 


That associated with u, on 4, is |a|(2x)~ CO" *”"/* |dé"|?, which corresponds 
with the coordinates used to 


ja] (22) Or PF id E43 |d E\ fd x'"| [dF -? |d x" * €Q(A,)* @ QUTA)@ QA) ~?*. 
This differs by the factor 

(2n)7 Gm M4 — (2 q)dim 4 — 2 dim 4m do —2dim An ayia 
from the identification of Q(A)? with Q(,)? @Q(A)* given by (21.6.21) and 
(21.6.22). 


b) If An A={0} we can choose the coordinates so that A is the €'x’” 
plane and J is transversal to the plane £=0. To do so we observe again that 


{0} CANA EADY CAp.- 
Choose a basis 6,,...,6, for AMA, and extend it to a basis &,...,&, 
Cpstete++s@, for A, where 2!=dimS,. In S, we can find e@,,,...,e¢,, 
spanning a plane transversal both to Ay, and to A,. Let ef,,,...,e¢,, be a 
biorthogonal basis for 4 5,, and choose e,,,,...,@,,,€4% and 


fxg trees €ea1EA47AAg in the classes of e@,,,...,ef,, Then &,...,84), 
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€,41,-.-,@, form a partial symplectic basis in S, and ¢),...,& 4) span A°MAo. 
If uz is the isotropic plane spanned by e4;,...,é, then pnAcAnd= {0} 
and podg={0} since uNAp CANA and &1,...,€, Cx+1,---2€n are linearly 
independent. To choose ¢,...,e, we observe that ¢,,...,¢, are in uw’ and 
linearly independent modulo yu. Thus we can find ef,...,e¢eS, biorthogonal 
to the classes of ¢,,...,¢, and spanning a Lagrangian plane in S, transversal 
to 4, and to A,. Now we pick e,,...,e,€u" in the classes of e{,...,e¢ and 
orthogonal to & 4,...,8&4), Which is possible since e,,,,...,¢,,,€u. This 
extends the partial symplectic basis, and the plane spanned by ¢,,...,¢, is 
transversal to A and to A, since the intersections are contained in yp. Because 
the Lagrangian plane spanned by e,,...,e, is transversal to 4) the remaining 
elements &,,),1,--->& in a symplectic basis can be chosen uniquely in Jp. 
With the coordinates chosen 4 is of the form x =Bé and (21.6.6) becomes 


u(x)=c(2n)~3"/4 f ei &) —<BE,ED/2) qe 

We have 
u g(x”) = (2m) ~ 34 [f ellCx E+E) — CBE EID dye" GE, 
The phase function Q(x”, x’”, )=<x", EO +. x", E'" — (BE, &/2 is non-de- 
generate, for 
C= {(x", x'”, C); 6Q/0x"” =0, GQ/0E =0} 

= {(Be, ¢); 6" =0, (BC) =0} =10 A? 

has dimension n”. We can take €” as coordinates there, hence 
de=|dé"| |D(6Q/8x", 6Q/dx'", 0Q/d6)/D(x", x’", E\-" 
=|d¢"| |det 27 Q/ag’a6|-*. 

The absolute value of the half density in 1, associated with u, is 

(2nyr"—" 4 |dE"F |det 07 Q/dE'0E|-# 
for —3n/4+(n"+2n4+2n'\/4=(n'" —n')/4. With u we have associated the 
half density 

cdl? =cldg'|*|dé"|* ae"? 
on A parametrized by € as {(Bé, é)}. If we take x’, é", €’”” as parameters 
instead, noting that |Dx’/Dé| =|det 070/dé'dé’| then 
eld él? =c\det 07 Q/AE Ge"\-* |dx'* |de"8 de". 

These variables are useful since A’ is defined by x’=¢’"=0; the quotient 
AANA’) is thus parametrized by x’ and €” which in (21.6.22) are paired 
with é and x’” by means of the symplectic form, identifying |dx’|?|dé’"|? 


with |dé’|~*\dx’"|-*. According to (21.6.21), (21.6.22) one thus identifies 
c|dé|? with 


cldet 6°Q/8' dE'|-#|dg'|-F|dx'"|-¥\de" 
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where &” is the parameter on AN A°x/,. Apart from the factor 
(2 nr" —n/4 (2 n)sim A-—2 dim AndAo)/4 


this agrees with the density obtained from u,. Summing up, we have 
proved: 


Theorem 21.6.6. Let A be an isotropic subspace of the symplectic vector space 
S and let S,=A°/A with the induced symplectic structure. If A¢A(S) then i, 
=(AN AVA Ae A(S,). There is a natural map 


M (A) @ Q(A)} +M(A4)@ QA)? @ QA A) @ Q(A)-* 


which in symplectic coordinates x’,x",x'", &',é",é'" such that A is the x’ &” 
plane maps the symbol of (21.6.16) to that of (21.6.20) multiplied by 
\dé|~* |dx'’|-*. It is a bundle map when the dimension of ANA is constant, 
and it induces maps 


M(4)>M(A,), _ Q(A)F + Q(Ay)*@ Q(A)-4 @ QUAN A). 


The latter is equal to the isomorphism defined by (21.6.21), (21.6.22) multiplied 
by 
(21.6.23) (277) dim 4— 2 dim 4ndo~ 2 dimdnay/4_ 


Our main application of Theorem 21.6.6 concerns the composition of 
linear canonical relations defined in Theorem 21.2.14. 


Theorem 21.6.7. Let S; be a symplectic vector space with symplectic form o,, 
j=1,2, 3, and distinguished Lagrangians Ajo, and let Gy cS, x82, GycS,xS3 
be linear canonical relations from S, to S, and S; to S, respectively, that is, 
Lagrangians for o,—o, and for o,—0;. Set G=G,°G,cS,xS,; and N 
={yeS,; (0, yeG,, (y, 0G}. Then (21.6.16), (21.6.20) give rise to natural 
bilinear maps 


M(G,)xM(G,)>M(G), — Q(G,)? x Q(G,)* + A/G)? ® Q(N). 


The latter map is (2n)~°'? times the map derived from (21.6.21), (21.6.22) where 
e=dimN is the excess of the (clean) composition, and Q(A(S,))? has been 
trivialized by means of the symplectic form in S,. 


Proof. This is the special case of Theorem 21.6.6 where A is the diagonal in 
S, times 0 in S, and in S,. We have identified Q(4) with C by means of the 
natural density defined by o'3?/n,! in S,2A. Note that (21.6.23) simplifies 
because dim 4 =2dim 4A). 


The map of densities in Theorem 21.6.7 simplifies a great deal if one of 
the relations, say G,, is the graph of a linear canonical transformation. 
Indeed, with A=G,xG, and A=A(S,) we have AN A={0} so (21.6.21) is 
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just the identification of 2, with AN A°=G,. (21.6.22) expresses the duality 
between A and 4/(AM A’) given by the symplectic form. Now 


A=(G, x {0})OUn4’) 


because (7,,72,72,73)€4 implies (G, 3, 73,72, 73)EA A’. The duality is 
therefore the self duality of S, when 4 and G, are both identified with S,. 
Thus the identification of Q(A)? with € means that we identify Q(G,)?=C 
using the natural density defined by o, in S, (or o, in S,), lifted to G,. 
Multiplication of d,¢Q?(G,) and d,€Q?(G,) therefore means that d, is 
identified with a constant c; and that one forms cd) € (G2)? = QG)}. 

As in the discussion preceding Definition 21.2.15 one can consider the 
general representation (21.6.16) of elements in M(A)@ Q(A)? as a special case 
of the map in Theorem 21.6.6. However, this argument would now be 
circular since (21.6.16) was used when we defined the map in Theorem 
21.6.6. 


We have insisted on invariance throughout this section in order to have 
an extension to symplectic vector bundles immediately available. Thus as- 
sume that Y is a manifold and E> Ya symplectic vector bundle with fiber 
dimension 2n over Y. This means that every point in Y is assumed to have a 
neighborhood U where there is a vector bundle isomorphism 


UxT*(R")> Ely 


respecting the symplectic structure. It is clear that the set A(E) of La- 
grangian planes in the fibers of E forms another fiber bundle on Y with fiber 
A(T*(R")). Assume now that we have a distinguished section A, of A(E). 
Then the Maslov line bundles M defined on the fibers A,(E) define a line 
bundle M(E)— A(E). In fact, we can trivialize E locally so that 4, corre- 
sponds to the fiber of T*(IR") at 0, and this gives a local trivialization of 
M(E). Different trivializations are obviously compatible. If now / is any 
other section of A(E) we can pull M(E) back to a line bundle 1* M(E) on Y 
which will also be called the Maslov bundie. 

The typical situation we have in mind is a Lagrangian submanifold Y 
<T*(X) where X is a C® manifold. Then the tangent spaces of T*(X) 
define a symplectic vector bundle E on Y and the tangents of the fibers in 
T*(X) give a distinguished Lagrangian section. We also have the La- 
grangian section of E given by the tangent planes of Y. This defines on Y a 
well defined Maslov line bundle, with structure group Z, of course. It is 
needed for the invariant (and global) statements in Chapter XXV. 


So far in this section we have emphasized the analytical origin of the 
Maslov bundle. To clarify its meaning we shall now supplement this with a 
brief purely geometrical discussion although this will not be required in the 
further developments here. In doing so we shall restrict ourselves to a 
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symplectic vector space S again. We can take S=T*(R") with the usual 
symplectic coordinates x, ¢. 

i) Let n=1. Every line in T*(R) is Lagrangian so A(S) is the projective 
line P,R). A generator for z,(A(S)) is therefore obtained by turning the x 
axis in the positive direction an angle of x. Let U,, j=1,2, be the set of lines 
transversal to 4, where /, is defined by €=0 and A, is defined by €=x. It is 
clear that U, and U, cover A(S). When AcEU, 0 U, we can define 4 by x =Bé 
where B+1. The integer (21.6.10) 

o(to,2shasary=dsen(~)  _ 5) 

is half the signature of the quadratic form —x?+2xé—Bé€?, that is, —1 if 
B>1 and 0 if B<1. Thus the Maslov 1 cocycle is defined by the cocycle 
c,,(B)= —1 if B>1, c,,(B)=0 if B<1. We can now compute the change of 
an element in the Maslov bundle as we go around P(R) once, that is, 
follow the generator for 2,(P,(IR)) described above. If we have the value c in 
the trivialization corresponding to U, then the value in U, on the line 4 
defined by x= Bé€ is c/i if B is large positive and c when B is large negative; 
when B goes from +o to —oo the element in the Maslov bundle will 
therefore be multiplied by i. We could make the same calculation with the 
cohomology class in H+(A(S)) defined by o and conclude that its intersec- 
tion with the generator of z,(A(S)) is equal to 1. 

ii) When n>1 we shall prove that every closed C’ curve in A(S) is 
homotopic to a sum of closed curves of the form 1,(4)x4, where 4,(t) 
< A(T*(R)) is the path described in i) and /, is a fixed Lagrangian plane in 
T*(R"~'). The choice of 2, is unimportant from the point of view of 
homotopy, for it follows from Lemma 21.6.2 that A(T*(IR"~4)) is connected. 
Let troA(t), A(t+1)=A(t), be the given C1 curve in A(S). We shall first prove 
that it is homotopic to a curve intersecting A) in a very simple way. For any 
to we can choose ¢>0 and suitable symplectic coordinates such that 4, is 
still defined by x =0 and 


AD ={(AOE,O}, |t—-tol Se, 


where A is a symmetric matrix which is a C' function of t. Let us examine 
the intersection with a Lagrangian plane x =Bé where B is symmetric. The 
intersections are given by (B—A(t))¢=0, so we must ask if B=A(t)+B, 
where B, is of rank k<n. The dimension of the manifold M, of all sym- 
metric B, of rank k was essentially determined in Lemma 21.6.3; to choose 
B, one must first choose a subspace of codimension k and then a non- 
degenerate quadratic form in the orthogonal space, so the dimension is 


k(n—k) +k(k+1)/2=n(n41)/2—(n—k) (n—k +1)/2. 


From the Morse-Sard theorem it follows that the range of the map 
[tp —9, tg + 6] x M, a(t, B,)r> A() 4B, is of measure 0 if k<n—1, and so is the 
set of critical values when k=n—1. If B is not in these sets and det (B— A(t)) 
=0 for some t with |t—t,|<6 it follows that the rank of B—A(t) is n—1 
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and that A’(t) is not a tangent to M,_, at B—A(t), that is, the derivative of 
det (B — A(t)) with respect to t is not equal to 0. Now the maps 


(x, ¢)ro(x—eBE, 6), OSeSI, 


are symplectic. If we use them to deform A(t) we obtain the desired simple 
intersection with 1) when |t—t)|<6, and choosing B small enough we do 
not disturb such a property already achieved in another closed set. After a 
finite number of such deformations we find that every homotopy class 
contains a curve such that 

a) A(t) A, ={0} except for finitely many values t,,...,ty of tmod1, and 
in a neighborhood of each of these points A(t) can be written in the form 


x=A(té. 


b) If A(t) is singular then A(t) has rank n—1 and det A(t) has a simple 
zero. 

Indeed, in order to obtain the second property in a) we just have to 
choose as the plane €=0 a Lagrangian plane transversal to 4, and a finite 
number of other Lagrangian planes. 

Except in a neighborhood of t,,..., fy modi we can represent A(t) in the 
form ¢=B(t)x. Choose y,eC® with OSy,S1, periodic with period 1, dis- 
joint supports and y;=1 in a neighboorhood of t;. Set y=2y,. Then the 
curves A® defined by 


A(t)= {x (ex()+(1—2)) BO) x)}, OSeS], 


where y+1, and 2*(t)=A(t) where y(t)=1, give a homotopy which connects 
the curve trs/(t) to a finite number of loops starting and ending at A, 
= {(x,0)}, each containing only one Lagrangian not transversal to do. It 
remains to examine a curve with only one such point. 

Assume as in b) that det A(t) has a simple zero when t=O, and let 
A(0)€,=0. We can choose coordinates so that € is the first unit vector, 
that is, 

0 
0 


where A, is a symmetric (n—1)x(n—1) matrix. The derivative of det A is 
Ai, (0) det A, so A{,(0)+0. The sign has a symplectically invariant in- 
terpretation. In fact, if E(t)eC' and €(0)=(1, ..., 0) then 


(0, £(0), A(t) S(O), €() = <A (9) E(0), €(0)> 


has the derivative A’, , (0) at 0. Hence 


0 
A(0)= ( Ph det Ay +0, 
0 


d 
(21.6.24) sgn Aj ,(0) = sgn WOOO) ¥(t))|t=0 


if t ++ y(t) is a curve with y(t) € A(t) and 0 # (0) € Ao also. 
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We may make a further homotopy of A near 0 so that in a neighborhood we 


obtain : 
_ [AO 0 
A(t) = ( 0 ) ; 


The deformation above can then be made in the variables £),...,€, for all t and 
gives a homotopy to A(t) defined by 


AE) = {%, €1, 0); 1 AY 0) = x) 1} 


when |t| < 1/2 and periodically continued to IR. This is the example in i), 
perhaps with a change of sign. We have therefore proved: 


Theorem 21.6.8. The first homotopy group 1,(A(S)) is isomorphic to Z. Every 
homotopy class contains a curve t-+A(t) such that A(t)nAg = {0} except for a 
finite number of values of t where the intersection is simple and has a well 
defined sign given by (21.6.24). The sum of the signs gives the isomorphism 
with Z. The intersection of the corresponding homology class with the Maslov 
cohomology class defined by (21.6.10) is equal to this integer v; going around 
the curve multiplies the elements in the Maslov line bundle by i’. 


Notes 


Most of Section 21.1 comes from classical mechanics and can be found in 
another form in, for example, Carathéodory [1]. The theory of Fourier 
integral operators created new interest in these matters. A special case of 
Theorem 21.1.9 for example can be found in Duistermaat-Hérmander [1], 
and a more general version was given by Melrose [2]. There is a great deal 
in the literature about degenerate symplectic structures, but we have only 
included what is directly related to the equivalence theorem of Melrose [2]. 
Also Section 21.2 begins with classical results. The importance of allowing 
clean intersections was stressed by Duistermaat-Guillemin [i] and Wein- 
stein [3]. The discussion of clean phase functions is taken from these papers. 
The non-degenerate case was discussed in Hérmander [26] and goes back 
to classical results on generating functions of canonical transformations. 

The normal form in Theorem 21.3.2 was established by Duistermaat- 
Hormander [1]. That in Theorem 21.3.3 is due to Sato-Kawai-Kashiwara 
[1] in the analytic case and Duistermaat-Sjéstrand [1] in the C® case. For 
the extension in Theorem 21.3.5 see also Yamamoto [1]. Theorem 21.3.6 is 
a key result in Nirenberg-Tréves [2]. 

The equivalence of glancing hypersurfaces was conjectured by Sato in 
the analytic case but Oshima [1] gave a counterexample. In the C™ case the 
result was proved true by Melrose [2], and Section 21.4 mainly follows his 
paper. The results on canonical relations with folds are due to Melrose and 
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Taylor (see Taylor [3]); they are also important in the study of mixed 
problems. 

The symplectic reduction of quadratic forms is an old topic in me- 
chanics, for it occurs in the study of Hamiltonian systems at an equilibrium 
point. The presentation here depends on Melin [1], Boutet de Monvel [4], 
Sjéstrand [3], Ivrii-Petkov [1] and Hérmander [36]. 

The Maslov index studied in Section 21.6 was defined by Maslov [1] in 
connection with his canonical operators. It expresses the classical obser- 
vation in geometrical optics that a phase shift of 2/2 takes place at a caustic 
(see Section 12.2), and it was formulated quite explicitly already by Keller 
[1]. Arnold [1] gave a clear presentation of the Maslov index which was 
rephrased somewhat in Hérmander [26]. The presentation here is modified 
in order to fit the case of Lagrangians with clean intersections. For the case 
of complex Lagrangians we refer to Melin-Sjéstrand [1] and Wang-Tsui 


[1]. 


Chapter XXII. Some Classes of 
(Micro-)Hypoelliptic Operators 


Summary 


In Chapter XI we have proved that a differential operator P(D) with 
constant coefficients is hypoelliptic if and only if for some p>0 


(22.1) P(£)/P(2)=O(\E|-"'") for every « when €—>0o in R". 
Recall that P(D) is called hypoelliptic if 
(22.2) sing suppu=sing supp P(D)u, ue". 


For operators with constant coefficients (22.2) is equivalent to microhypoel- 
lipticity, 


(22.3) WF (u) = WF (P(D)u), ueQ’, 


Microhypoellipticity implies hypoellipticity but the converse is not always 
true. We shall mainly consider microhypoellipticity here. 

In Chapter XIII we extended the results just mentioned to operators of 
constant strength. These are differential operators P(x, D) such that 


(22.1)! |DEDEP(x, OS CyplP(x, S| |e"! for large (1. 


However, the perturbation methods based on pseudo-differential operators 
are much more powerful than those in Chapter XIII. This will be seen in 
Section 22.1 where we improve the results of Section 13.4 by studying 
pseudo-differential operators (of type p, 6) with a pseudo-differential para- 
matrix. These results still have serious flaws though. One is that pseudo- 
differential operators of type p, 6 are not always invariant under changes of 
variables so the classes of hypoelliptic operators obtained are not in- 
variantly defined either. A striking example is given how the hypoellipticity 
of the heat equation may be masked by a change of variables from the 
point of view of the criteria in Section 22.1. Another flaw is that the results 
proved in Section 22.1 do not cover some simple types of hypoelliptic 
operators occurring in probability theory such as the Kolmogorov equation 
(7.6.13) for which we have already given an elementary proof of hypoellip- 
ticity. 
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Section 22.2 is therefore devoted to a study of “generalized Kolmogorov 
operators” of the form : 
VXF7 +X 


where X;, are real vector fields. It is proved that such operators are hy- 
poelliptic if the Lie algebra generated by all X; gives a basis for all vector 
fields at any point. The methods used in the proof can also be applied to 
pseudo-differential operators with non-negative principal symbol p,, under 
similar symplectically invariant conditions on-p,, and on the subprincipal 
symbol p;,_,. It is shown that if PucH,, at some point then ueH,,, m_r+0 
there for an ee(0, 1]; it is customary to say that one has hypoellipticity with 
loss of 2—e derivatives then, since this is the deficit compared to the elliptic 
case. In the proof we use the Fefferman-Phong lower bound of pseudo- 
differential operators proved in Section 18.6 but otherwise we shall only rely 
on facts concerning pseudo-differential operators contained in Section 18.1 
until we reach Section 22.4. There the general theory of Sections 18.5 and 
18.6 will be used freely. 

The lower bound of pseudo-differential operators due to Fefferman and 
Phong is very precise in some respects and very weak in others. In Section 
22.3 we therefore prove another lower bound due to Melin which gives 
necessary and sufficient conditions for a first order operator to majorize a 
positive constant operator. This estimate improves the theorems on hy- 
poellipticity in Section 22.2 for the case of loss of exactly one derivative. 
Complete results on such operators with principal symbol taking values in a 
proper convex angle in € are proved in Section 22.4. As already mentioned, 
the proofs there are less elementary and Section 22.4 will not be referred to 
in later chapters. However, the methods are interesting and will reappear 
later in Chapters XXVI and XXVII in related contexts. 
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Let X be an open set in R” and P=(Py)j.21,..,.y a NXN matrix of 
properly supported pseudo-differential operators P,<'P™, for some p, 6 with 
0<6<pS1. (See the end of Section 18.1.) Under hypotheses similar to (22.1) 
we want to prove that P is hypoelliptic, that is, that solutions u of the 
system 


are smooth where f is smooth. To state the hypotheses on the symbol 
p(x, )eS%5 of P we introduce a notion of temperate norm on €%, parame- 
trized by T*(X): A norm || ||,,- on C% defined for every (x, )eT*(X) will 
be called temperate if for every compact set KX there are constants C 
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and M such that with ||z{|=()"|z,|’)* 

(22.1.1) C7 M+) ™ Iz Size S CUED Iz; xe K, GER", ze. 
Definition 22.1.1. A symbol p(x, €)eS*,(X x IR") with values in L(C*, €*) is 
said to be hypoelliptic if there exist two temperate €* norms || ||... and 


| |x.~ parametrized by T*(X) such that for every compact set Kc X there 
are constants C, C,, with 


(22. 1.2) IZl,eSCllp(x, zie; xeK, \el>C, zeC*; 
(22.1.3) I(x, ) 2 eS Cap(h +E) Plt |zi es | xeK. 
It would not really have been necessary to assume peS®, for this is a 


consequence of (22.1.3). To clarify the meaning of these conditions we 
assume first that N=1. Then |jz/|., -= M(x, €) |/z||,- and the conditions are 


M(x, )SC\|p(x, OI,  PB(x, HS Cyg M(x, YL + [EI lar e 
so M(x, €) must be equivalent to |p(x, €)| for large |é|, and 
p(x, NS Cplp(x, HL +e)", = xe K, |€[>C. 


This weakened form of (22.1) together with polynomial bounds for p and 
1/p for large |é| is equivalent to the hypoellipticity i in the definition. 
If we choose 


Iz eH +leylizi, tz =A +1e iizi, 


the condition (22.1.3) is that peS'”; and (22.1.2) means that 1/peS¢-; for 
large |¢|. More generally, we could split the z variables into several groups 
for which we use different exponents s and t. This grading leads to the 
Agmon-Douglis-Nirenberg elliptic systems where the notion of principal 
symbol depends on the location in the matrix symbol. (See Section 19.5.) - 
Here we have tacitly assumed the following simple but important lemma. 


Lemma 22.1.2. If the hypotheses of Definition 22.1.1 are fulfilled, then 


(22.1.4) |DgDE p(x, 6)-' zi, -SCap(d +16) lel iziiy gs «= xeK, [El>C. 


Proof. (22.1.4) concides with (22.1.2) when «=f=0 and will be proved in 
general by induction for increasing |x+ |. Differentiation of the equation 
p(x, €) p(x, 2)" =1 
gives when |é|>C 
p(x, €) Df D8 p(x, €) — Jey g DE DE p(x, €) DE DE" p(x, 2)! 
where o'+a”=a, B’+B"=B and |x’+P"\|<|a+ | in the sum. By the in- 
ductive hypothesis and (22.1.3) we obtain 
llp(x, €) DEDE p(x, €)-? 2% eS Cop(L +E tl zie 
so (22.1.4) follows from (22.1.2). 
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We can now prove an extension of Theorem 18.1.9. 


Theorem 22.1.3. Assume that Pe¥,°,(X;€%,C) is properly supported and 
that the symbol satisfies the hypotheses in Definition 22.1.1 with O<é6<pSl. 
Then one can find Q satisfying the same hypotheses with the norms in 
Definition 22.1.1 interchanged and 


PQ=I+R,, QP=I+R, 
where R, and R, are of order — ov. 


Proof. As in the proof of Theorem 18.1.9 it is sufficient to construct Q,,Q, 
with PQ,=I+R,, Q,P=I1+R, and R,, R, of order —oo, for Q,—Q, is 
then automatically of order —oo. By Lemma 22.1.2 we can choose Q, 
properly supported with symbol p(x, €)~' for large |Z]. The symbol of 


Ry=PQ,-1 
has the asymptotic expansion 


ro(x, C)~ 2 Dope, 6) DE p(x, 6) */a! 


so using (22.1.3), (22.1.4) we obtain 
Ney oe, €) ZI eS Cap (1 + [EPP Plt OF iz 


The symbol of R‘ satisfies the same condition with 6—p replaced by 
k(6—p) in the exponent. Since || ||. is a temperate norm it follows that 
RoevT*e-” for some fixed M, at least on a compact set, so there is an 
operator T with 
T~ ¥ (—Roie PMS NOP) 
J<N 

for every N. Thus PQ, T=I+R, with R, of order —oo. The symbol of Q, 
=Q,T satisfies estimates of the form (22.1.4). Similarly we construct Q, 
noting that Q, P=I+Ro where Rj has the same properties as Ry with || ||” 
replaced by || ||’. This completes the proof. 


The construction of Q, could also have been obtained from the con- 
struction of Q, by passage to the adjoint. In fact, if P satisfies the hy- 
potheses of Theorem 22.1.3 then P* also satisfies them with respect to the 
duals of the norms || ||\f. and || ||... We leave the simple verification for 
the reader. Instead we give the main application of Theorem 22.1.3. 


Theorem 22.1.4. Assume that Pe ¥,°5(X :C% ,C%) is properly supported and 
that the symbol is hypoelliptic in the sense of Definition 22.1.1. Then P is 
microhypoelliptic, that is, 


(22.1.5) WF (u)=WF(Pu), ueQD'(X, C%). 
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Proof. The inciusion 
WF (Pu)c WF (u) 


is tne global p, 6 version of (18.1.25), which also gives 
WF(QPu)< WF(Pu) 


if Q is the parametrix in Theorem 22.1.3. Since QPu—ueC®, this proves 
(22.3.5). 


The result in Theorem 22.1:4 can of course be micro-localized further; if 
(22.1.2), (22:1.3) are only satisfied in an open cone '< T*(X)~0 then (22.1.5) 
is valid in . On the other hand, when 1—pSé6<pX1 the results in this 
section can be given a global form where X is a manifold and 
PeW™,(X ; E,F) for two vector bundles E, F on X with equal fiber dimen- 
sion. The norms || ||‘. and || ||’. are then interpreted as hermitian norms 
on the bundles FE and F lifted to T*(X). One just has to show that the 
conditions in Theorem 22.1.1 are invariant under a change of variables. We 
leave for the reader to verify that this follows from the p,6 version of 
Theorem 18.1.17. Note that such invariant classes of hypoelliptic operators 
are only obtained when p>4. The heat operator 07/0x7—4/@x, satisfies 
(22.1) with p=4. If we introduce 


Yr=X1, V2 =Xq4x4/2 

then 
8°/6xt —6/0x2 =(6/8y, + y1 0/8 ya)’ —G/0y2 

=O? Oy} + yi 07 /0y3+2y1 O7/8y, Oyr. 
The symbol —(7,+ y,1,)* does not satisfy the conditions in Definition 
22.1.1. This shows that the restriction p>4 is essential and also that just 
comparing the symbol and its derivatives as we have done here is not 
always a satisfactory method for determining if an operator is hypoelliptic. 

On the other hand, Theorem 22.1.4 proves the hypoellipticity of many 


operators which are very far from the operators of constant strength dis- 
cussed in Chapter XIII. An example is the symbol 


P(x, d)=c + |x]? 16)" 


where c>0. The hypotheses of Theorem 22.1.4 are fulfilled with p=1 and 6 
=p/v if u<v. Since 


«|x|? DI? 6, 6> = — |x|? $(0) = —2n $0) 


we see that 2n+jx|?|D|? is not hypoelliptic, so the condition u<v cannot be 
omitted. 
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22.2. Generalized Kolmogorov Equations 


As an example of Fourier transforms of Gaussian functions we constructed 
in Section 7.6 a fundamental solution of the Kolmogorov equation 


67 u/dx? +x du/dy—du/dt=f, 


which represents Brownian motion on R,x denoting the velocity and y the 
position of a particle. The fundamental solution is smooth outside the 
diagonal and one can easily deduce that the Kolmogorov equation is 
hypoelliptic. It is clear that it does not satisfy the conditions in Definition 
22.1.1 for any choice of variables. 

In this section we shall study more general equations of the form 


(22.2.1) (x B+ Lo +c) way 


where L; are real C” vector fields in the n dimensional manifold X and 
ceC”(X). It is easy to see that (22.2.1) cannot be hypoelliptic if c=0 and the 
rank of the Lie algebra generated by Lo,...,L, is <n in an open set Y, that 
is, the vector fields 


(22.2.2) EAT Webs gD se: 


have rank <n at every point in Y In fact, in a neighborhood of a point 
where the rank k is maximal we can by the Frobenius theorem (Theorem 
C.1.1 in the appendix) choose local coordinates such that 


k 
Lj;=), aj0/0x,, JH..." 
1 


All functions of x, are then solutions of the homogeneous equation (22.2.1). 
On the other hand, we shall prove that the equation is microhypoelliptic if 
the rank is n everywhere: 


Theorem 22.2.1. Let L;, j=0,...,7, be real C® vector fields in the manifold X 
generating a Lie algebra of rank dim X at every point. Then the equation 
(22.2.1) is microhypoelliptic. 


Example. The operator 47/0x3+)\ x7 6/@x; is microhypoelliptic if OSm, 
<m3<...<mp are integers. 2 
The statement is local so we may assume in the proof that X <IR”. Set 


P= daly Lore 


If L(x, ¢) is the symbol of L,/i, which is a real linear form in ¢. then the 
principal symbol of P is 


PAx, =) L(x, é)*. 
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We can write ‘ 
Be T+ce 


J 


where [*= —L,—c;, is the adjoint of L; and T=—L +) c,L;. Then T and 
L,,...,L, also generate a Lie algebra of rank n. Denote the I? norm by || |. 


Lemma 22.2.2. If K is a compact subset of X then 


(22.2.3) ¥ ||L,ull?<Re(Pu,u)+ Cx |lull2, ue C%(K), 
1 


(22.2.4) by \PP(x, D) ulloy a 2 Paw (% D)u'| a 1) 
1 1 


SC, Re(Pu,u)+ Cx lull?, we C3(K). 
Proof. (22.2.3) follows from the identity 


Re(Pu, u) => |L; ull? +((T+ T*)/2 +c) u, u) 
since T+ T* is just multiplication by a function. Now 
py =2 Y OL JOE, Lj, — Pay =? Y eL Ox, L; 
so p(x, D) is a linear combination of L(x, D) with C® coefficients while 


Pry) (X, D)—2 » Lil, D) Lx, D) 
is of order 1. This proves (22.2.4). 


We shall prove Theorem 22.2.1 using only the estimates in Lemma 22.2.2 
and that p, is real valued. This will allow us to generalize the theorem at 
the end of the section without repeating the proof. Thus we now denote by 
P=p(x, D) a properly supported pseudo-differential operator of order 2 with 
real principal symbol p, such that (22.2.4) is fulfilled. We denote by Q, the 
set of all properly supported first order operators q(x, D) with real principal 
symbol such that for every K 


(22.2.3) |q(x, D) ul]? SCx Re(Pu,u)+ Cxljull?, we C(K). 


It is clear that Q, contains all operators of order 0 so this is just a 
condition on the principal symbol. Let E, be a properly supported self 
adjoint pseudo-differential operator with symbol (1+|é|?)”?. Then the as- 
sumption (22.2.4) means that P’=p)(x,D) and P,=E_, pa(x, D) are in Q;. 
Let Q, be the set consisting of (P—P*)/i and all commutators [q, q’]/i with 
4,q€Q,, and define Q, successively for k>2 as the set of all [g,q’]/i with 
qeQ,_, and q’eQ, or geQ,_, and qg'eQ,. In view of the Jacobi identity 


(a. L@’,@"JJ=(L¢.4').9"1+1[l¢",¢1,4'] 
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we just have to use the element (P—P*)/i from Q, here. The main step in 
the proof of Theorem 22.2.1 is now the following 

Lemma 22.2.3. If q,€Q, and e<2'—* we have for every KEX 

(22.2.5) Hd Ule-1y S C(I Pull + lull), weCe (kK). 

Proof. (22.2.5) follows from (22.2.3) when k=1,. In the proof for k>1 it is 


convenient to have Re(Pu,u)20 which we can always achieve by adding a 
large constant to P, since we work in a compact set. Write 


P'=(P+P*)/2,  P” =(P—P*)/2i 
for the real and imaginary parts of P. Thus P’ is assumed to be positive. To 
prove (22.2.5) for k=2 and q,=P” we have to estimate 
||E_, P’ ull? =(P" u, Au)=((P—P’)u, Au)/i 
where A =E? , P” is of order 0. It is clear that 
(Pu, Au)| SC || Pull lull, 
and Cauchy-Schwarz’ inequality gives in view of the positivity of P’ 
\(P’ u, Au)| <(P’ u, u)?(P’ Au, Au)? S(Re(Pu, u) + Re(PAu, Au))/2. 
The calculus gives 
(22.2.6) [P, A]=))(A,P+ Aj P/)+Ao 
where Aj, Aj and A, are of order 0. Hence 
Re(PAu, Au)=Re(APu, Au)+ Re([P, A] u, Au) 
SC(||Pul] +¥ | Ppul] +¥ Pew + lull) al 


which proves (22.2.5) for P”. 
Before the general inductive proof we observe that (22.2.5) implies 


(22.2.5) |Bq,Aul.SC (Pull + lull),  uweCo(A), 


s+order A+order B<e—1. In fact, the order of B[q,,A] is at most equal to 
é—1~s<—s, and by (22.2.5) the desired estimate is obviously valid for 
|| BA q,ull,,). It is also useful to note that 


(22.2.7) ||BPAul|+ |BLP, AlullySC(|Pull+ |lul), uweCo(k), 


if s+order A+ order B<0. For the commutator this is clear since we have 
an identity of the form (22.2.6) with Aj, Aj, Ao of the same order as A. Since 
||BAPul|,,, has an estimate of the desired form, this proves (22.2.7). In the 
commutator term we may replace P by P’ since B [P”, A] is of order < —s. 

Assume now that (22.2.5) is valid for a certain value of k. To prove 
(22.2.5) for higher values of k we must show that if geQ, then 


C4, 4a) Hl (2-1) SCM Pull + lull), weCo (kK). 
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This means that we must estimate 


(La, 4x] u, Au) =(4, u, qAu) ~ (q u, q, Au) 


where A=E2,_,[4, q,] 1s of order e—1. (We may assume q,q, self-adjoint 
since only the principal symbol is important.) But 


Ndi tllee—1y EAU ey 2) + Ha ello, la Aull oy S C(I Pull + lah)? 
by (22.2.5), so [q,q,]u has the desired estimate. 
Finally we shall prove that (22.2.5) implies that 
IP’, de] Uli ea ty SS C(/Pull + lull), we Co(K). 
This means that we must estimate 
({P",4,] u, Au)=i((P* — P’')q,u, Au)+i(q,{P —P')u, Au) 
where A is of order ¢/2—1. The term 
(q,4, PAu) S 4, Ulle_iy |PAull i _2 


has an estimate of the required form by the assumed estimate (22.2.5) and 

(22.2.7), and so has the term (Pu,q,Au). What remains is to estimate 

(q,u, P’ Au) and (P’u,qg,Au). We may commute P’ and A in the first term, for 
We Ullce— 1) || LP’, A] ules —e) 


has the appropriate estimate since 1 — ¢ + orderA < 0. Now an application of 
the Cauchy-Schwarz inequality as in the estimate of P” above shows that 
we only have to establish an estimate by (||Pu|| + |u|)? for 
(P’ q, Au, q,Au) = Re(Pq,Au,q,Aw), 
(P’ A* q,u, A* g,u) = Re(PA* q,u, A* q,u). 
These estimates follow since by the inductive hypothesis (22.2.5) and by 
(22.2.7) we have 
|Pq,Aul\._.j2)+ ||P.A* Fe Ull— esa) + 4, AUll ea) + || A* GU ll ces2) 
SC(||Pull+ lull), weCo(K). 
The proof is complete. 
The hypothesis that the vector fields (22.2.2) have rank n at every point 
means that the operators L,, [L,,,L,,], [L;,,1£;,.L;,]],--- have no charac- 


teristic point in common. In the following lemma we use a generalized 
version of this assumption: 


Lemma 22.2.4. Assume that for some integer N the operators in Q,U...UQn 
have no characteristic point in common. If eS2-% it follows then that for 
every compact set K< X and every seR 


(22.2.8) lullcr 2) FD IPullsry td [Palhcee 
SC, x(lPull+ luli),  weCo(K). 
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Proof. Choose q,,...,¢;€Q,U...UQ, with no common characteristic point 
over K. By Lemma 22.2.3 


F hay D) alae S CUlPul + bul ue C3 (K), 
hence 
(22.2.9) lull (oe SCC Pullt iui), weCo(K). 
By hypothesis P, and P/ are in Q,; hence we have by (22.2.3) for every B>0 
(2.2.10) Vi(Ppull + Piull)S CB Pull, +B-* lull + lull), weCo(K). 


Since we have such estimates for every K we may replace u by E,u in 
(22.2.9) and by E,,,u in (22.2.10). Since the symbol of E, modS~®” 
is (1 +|é|7)/7, we obtain 
[lie l|cs42ey S [Esuel|aey + Clu lle, 
[Pie llisrey S ||PjEs+eull@ + Clu |leore 
and similarly for P/, for [P;, Es+e] and [P!, E,4.] are of order s +¢. The calculus 
gives 
E_,PE 


& S+E 


=E,P+i(s+e))\D,E,_,P,+R, 
where R is of order s. Hence 

IPE... SllPullyt+ Cd Pull) + Cll. 
Combining these estimates we now obtain for sufficiently large B 


(2.2.11) lll s2ey to WP Ulleseey td PP ull cre 
SC(] Pull +d Pll 7D IPP ull + Mallee) 


Now it follows from Hélder’s inequality that the H,, norms are logarithmi- 
cally convex functions of s, so we have for example 


IF lis S IFS UPS SOF leg t OPP leap» 5 >0. 


If we estimate C||P,ulj,,. in (22.2.11) by ||Piuli.,/2+C lull and Cllul..,.) 
by lulliss25/2+C' lull.) in (22.2.11), we obtain (22.2.8). 


Lemma 22.2.5. Let the assumptions in Lemma 22.2.4 be fulfilled. If Pu 


=feEH& at (xo,€)€T*(X)X0, it follows then that 


(22.2.12) ucH? ., PucH'?, PiucH'®. at (x9,Eo): 


Proof. We may assume in the proof that 


(22.2.13) ucH. PucHy, PilueHy at (Xo, 60). 
In fact, this hypothesis is certainly fulfilled with s replaced by s—Ne if N is 
large enough. If the statement is proved under this additional hypothesis we 


can therefore successively conclude that (22.2.13) is valid with s replaced by 
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s—ke, k=N, N—-1,...,0. We may also assume that ueé’(X) and can then 
modify P outside a neighborhood K of suppu, where (22.2.8) remains valid, 
so that PeOpS?(R"xR"). Choose weS°® with 1/yeS° in a conic neigh- 
borhood of (xg, &,) so that y is of order — oo outside a conic neighborhood 
of (x9,é9) where (22.2.13) is fulfilled. We can choose w(x, D) properly sup- 
ported, in fact so that supp w(x, D)vcK for all v with support near supp wu. 
Choose ye Cf (IR") with 4(0)=1 and set 


W(x, V=W(xC)Z(60), P= W(x, D). 


Then y, is bounded in S° and Y¥,ueC%(K) for small 6. It is clear that 
|'%; ull.) is uniformly bounded when 6-0. Since 


P¥,u=¥,Pu—-i) W,,(x,D)Piut+iy Wi(x, D) Pu—R,(x, D)u 


where R;, Wj, and yj are bounded in S° and of order —co outside a cone 
where (22.2.13) holds, we conclude when (22.2.8) is applied to ¥,u that 


Ms wlhess rey + NP Meallicrey tL IP Pe elhecr ny 
is bounded when 6-0. This proves (22.2.12). 


With Lemma 22.2.5 we have completed the proof of Theorem 22.2.1 and 
even proved that PueH,, at (x,¢) implies ueH,,, 5, at (x,€) for some e>0. 
We shall now examine the more general results which are actually con- 
tained in the preceding lemmas. Let X be a C® manifold and P a properly 
supported pseudo-differential operator in ¥”"(X) such that the refined prin- 
cipal symbol o(P) (cf. Theorem 18.1.33) satisfies 


(22.2.14) Imo(P)eS"-!,  Reo(P)+r20 


for some reS”~?, If X CIR" and p(x, é) is the full symbol of P this means 
that 


(22.2.14y ImpeS"-', Rep+r20 


for some reS”~*. For the sake of simplicity we assume X CIR” in what 
follows, but since the conclusions will be local they are valid for a manifold. 
As before we denote by E, a properly supported pseudo-differential opera- 
tor with symbol (1+|¢|7)”?. Then ueH,,, is equivalent to E,ueH,,_, and 
E,P satisfies (22.2.14) with m replaced by m+s since the real part of the 
symbol is (1+ |é|*)/? Rep(x,é) mod S™*~?. Taking s=2—m we reduce the 
study to the case of operators with m=2 which we assume in what follows. 
By the Fefferman-Phong inequality (Corollary 18.6.11) we have then for 
every Kc X 
Re(Pu,u)=—C,x|lull?, uweC(K). 


More generally, let Q, be the set of all properly supported ge¥'(X) with 
real principal symbol such that for every KE X 


(22.2.15) la(x, |? SC, Rep(x,é)+ Cy, xeK. 
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If we apply the Fefferman-Phong inequality to P—6q(x, D)* q(x,D) with 
6C,<1 it follows that (22.2.3) is valid for every qgeQ,, for the real part of 
the symbol of q(x, D)* q(x, D) is |q(x, 6)|* mod S°, and the imaginary part is in 
Ss, 

The inequality (22.2.15) is satisfied by the operators P, and Pi with 
principal symbol (1+1é?)~* pp(x,é) and p(x, é) respectively. In fact, if 
R21 then the derivatives of order <2 of 


Salx, 6) = Re p(x, R&R? 


have a bound independent of R when $<|é|<2 and x is in a neighborhood 
of K. Adding a constant to p we may assume that f,20. Then Lemma 7.7.2 
gives 


la fa(x, 6)/A(x, MIPS Cha 6), [l= 1. 


Since Im peS? this proves (22.2.15) for the principal symbols of P, and P/, 
From Lemmas 22.2.3 to 22.2.5 we now obtain: 


Theorem 22.2.6. Let Pe¥"(X) be properly supported and assume that the 
refined principal symbol satisfies (22.2.14) for some reS"~*, in a conic neigh- 
borhood of (X9,€o)ET*(X)\0. Let Q, be the set of all Qe¥"~' with real 
principal symbol q such that in a conic neighborhood of (Xo, &o) 


la(x, 7 SC’ Re p(x, Z)(1+1E/7)"— 7 +O" + 1E)7"—7, 


and assume that (Xo, € ) is non-characteristic with respect to some commutator 
of v, factors in Q, and v, factors P—P* with vy, +2v,<N. If e=2~% it follows 
that 


(22.2.16) ueGD(X),PueH(y at (xo,&>) implies ue H's at (Xo, &o). 


(s+m—2+4 26) 


Note that a commutator [9;(x, D), [go(x, D),{...,qx(x,D)]...]] of k operators 
of order m — 1 is an operator of order k(m — 1) —k +1 with principal symbol 


{aix, €), {90(x, 8), {---, u(x, OF -. FRE. 


The hypothesis is therefore that some of these repeated Poisson brackets of 
q; satisfying (22.2.15) or g;=Im p;,_, is invertible in a conic neighborhood of 
(X9,€o). We have stated the result with local conditions on the symbol of P. 
This improvement is immediately obtained by adding to P an operator with 
symbol ||" a(x, €) where 0<aeS° is 0 in a conic neighborhood of (xg, €9) 
and is positive outside another such neighborhood where the hypotheses are 
applicable. 


22.3. Melin’s Inequality 


The inequality of Fefferman and Phong played an important role in the 
proof of Theorem 22.2.6. It is a very precise result in the sense that for self- 
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adjoint operators p(x,D) of. order >2 it may happen that p(x, D) is not 
bounded from below in LV’ although Rep(x,é) is bounded from below. 
However, the result is weak in the sense that an operator p(x, D) of order 
two may be bounded from below in I? although the symbol tends to — co 
as fast as |é| in some directions. The reason for this is that the harmonic 
oscillator D? +x? on R has the lower bound 1 because 


((D* +x*)u,u)=|\(D—ix)ul|? + |uil?,  weCPdR). 


We shall prove some lower bounds for pseudo-differential operators with 
polyhomogeneous symbols by imitating this identity. As an application this 
will give a supplement to Theorem 22.2.6. We begin with an algebraic 
lemma. 


Lemma 22.3.1. Let Q be a positive semi-definite quadratic form in IR?" and V, 
the space of generalized eigenvectors in C*" with eigenvalue A of the Hamilton 
map F of Q. Choose a unitary basis v4,...,v, for V*=@ V,, with the 


u>od 
hermitian form Q(d,v)/2, and an orthogonal basis v,,1,.--,0,,, for the real 


part of Vo/KerF with the quadratic form induced by Q there. If L,(x,€) 
=Q(v,, (x, ¢)) it follows that 


(22.3.1) (Q(x, D) + Q(x, D)*)/2=¥ L(x, D)* L(x, D)+Tr* Q, 
or equivalently 


(22.3.2) DIL S|? = Ax, 4), 
¥ {ReL,,ImL}+Tr*Q=0, (x, eR”. 


k 
Here Tr* Q=) yu, where ip; are the eigenvalues of F restricted to V*. 
1 


Proof. The symbol of L,(x, D)* is L,(x,¢) so that of the self-adjoint operator 
Y L(x, D)* L(x, D) is 


DIL, (x, 6)? +i" D) OL ,(x, 6/86, AL (x, 6)/0x,. 


The symbol of (Q(x, D)+ Q(x, D)*)/2 is 


Q(x, €) + ¥ 6? Q/dx, d€,/2i. 


Taking the real part of the symbols we find that (22.3.1) implies (22.3.2); the 
converse is clear since two self-adjoint operators cannot differ by a purely 
imaginary constant +0. 

The condition (22.3.2) is clearly symplectically invariant, and (22.3.1) is 
obviously invariant under unitary transformations of v,,...,v, and orthogo- 
nal transformations of v,,,,...,0,- By Theorem 21.5.3 it is therefore suf- 
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ficient to verify one of the conditions (22.3.1), (22.3.2) when 


k+l 


k 
O(x, =P ulxft+oj)+ Y x}; 
7 Ke 
vj=pj*(e,t+ie), LSjSk;  v,=e,, k<jSk4l. 


Here e, and ¢, are unit vectors along the x, and ¢, axes. (22.3.2) is then 
obvious for Q(v,, (x, 8) =W(xj+ ie), ISK OO), 0%) =x), k<jSk+L. 


For operators with polyhomogeneous symbol we can now give a sub- 
stantial improvement of the sharp Garding inequality (Theorem 18.1.14). 
Theorem 22.3.2. Assume that Pe¥3"(X, Q*) is properly supported and self-ad- 
joint. Let p,,, and p’,,_, be the principal and subprincipal symbols and assume 
that 


(22.3.3) Pam(%2)20 in T*(X)NO, 
(22.3.4) phy. 2)+Tr* Q, 20 if (%2ET*(X)\ 0, pap (x, €)=0. 


Here Q,. is the Hessian of P,/2 at (x,¢). Finally we assume that the 
characteristic set X = {(x, )eT*(X) 0, p,,,(x,€)=0} is a C® manifold, that 
T,, (2) is the radical of Q,. when (x,é)eX and that the symplectic form has 
constant rank on X. For every compact set K <X one can then find a constant 
Cx such that 


(22.3.5) (Pu,u)>—Cx lull2,-, we C2(K). 


Before the proof we make some observations on the statement. That P is 
self-adjoint implies that p,,, and p5,,_, are real valued. The condition (22.3.3) 
implies that dp,,,=0 on 2, so the Hessian is invariantly defined on JZ. It is 
automatically semi-definite and vanishes on T, .(2). The assumption on the 
radical is therefore that Q,. is positive definite in a plane transversal to 
T,, (2); the condition is often referred to as transversal ellipticity. 


Proof of Theorem 22.3.2. Let E, be a properly supported operator with 
principal symbol (1 +|é|)’’? (defined with respect to some Riemannian met- 
ric). Then it is sufficient to prove that 


(PE, _,U,E,_,W2—Cxllull, ueCP(K), 


for this gives (22.3.5) when u is replaced by Au where A is a parametrix of 
E,_». The principal symbol of E*_,,PE,_,, is |&|?~?"p2,(x,é) and the 
subprincipal symbol is |é|*~?” p§,,_ ,(x, ¢) since it must be real valued. The 
hypotheses in the theorem are therefore fulfilled by Ej_,,PEi-m, so we may 
assume that m=1 in the proof. 

We can apply Lemma 22.3.1 to the quadratic form Q, . in T, (T*(X)), 
(x, ¢)eX. The radical is T, (2) and the rank of the symplectic form there is 
constant. In the notation of Lemma 22.3.1 this means that 2n—2k-—I and 
2(n—k-—l) are constant, so k and / are constant. It follows that the spaces 
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V* and VW vary smoothly with (x, eX, although this is of course not true 
for the individual spaces V,,. For all (x,é)e2 in a neighborhood of any 
given (X9,é,)eX we can then choose C®” complex tangent vectors 
v,(x, €), ...,v,(x, €) forming a unitary basis for V* and real tangent vectors 
Up 4 1(%, Os ---5P4,(%,¢) forming an orthogonal basis for the real part of 
Vmod T, .(2). Then 


Lv)=Q,,(vj,0), veT(T*(X)) 


is a (complex) cotangent vector of T*(X) at (x,é) which is conormal 
to 2. With A, -=(ReL,, ImL,,...,ReL,, ImL,,L,,1,---,L,4)) we have by 
(22.3.2) 


2k+1 


(22.3.6) 0, v= ¥ A, z (v)’, veT,, (T*(X)). 
r 


Now the Morse lemma (Lemma C.6.1) shows that near (x9,€)) we can 
write 


2k+l 
(22.3.7) prlt,6) = bx, &? 


where b,eC®. Indeed, if we choose local coordinates y,z so that 2 is defined 
by z=0 then 6?p,/dz’ is non-singular by hypothesis so the lemma shows 
that p, is a quadratic form in z for some other local coordinates. If we 
restrict b, to the surface |¢|=|¢,| and then extend to a homogeneous func- 
tion of degree 1 we still have (22.3.7), b;eC™, and b,; is homogeneous of 
degree 1. : 

On 2 we have Q, .=)'db?, hence 


Ay gi =) 0; ;(x, €)db (x, ¢) 
j 


where O is a C™ function and the matrix (O,,) is orthogonal. We restrict 0 
to |€|=|€)| and then extend O to a C®™ function of (x, €), homogeneous of 
degree 0, in a full conic neighborhood of (x9, &9). Set 


C(x, €)= YO, é) b (x, é). 
Then 


(22.3.7) p2(x, => ¢,(x, £), 


and dc;(x, €)=A in a neighborhood of (xg, €)) on X when |€|=|é,|. Set 


x,8,% 
X (x, C)=C2;_ 4 (%, ) Fic (x, ¢), jal... & 
X,(x, =e, (066), fekti,..k+h 


and let Xj Ping X) be properly supported with this principal symbol in a 


conic neighborhood of (x9, €,). Then 
Q=P— > XF Xx; 
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is self-adjoint and of first order in a conic neighborhood of (x9, €,). The 
principal symbol q, is real there. On Y we have 


i (x, S)=py(x, €)—-i-* YO (x, 2/0E, 0X (x, Q/Ox,, 


and since q, is real we obtain by taking the real part 
k 
q, (x, €)=Rep,(x, 6+) {Im X,, Re X}} =p} (x, )+Tr* Q, . 
1 


when (x, €)eX is close to (xp, &,). The equality follows from the second part 
of (22.3.2). 

The hypothesis (22.3.4) is that q,20 on 2. We can therefore choose a 
homogeneous function g,, of degree 1, which is non-negative everywhere 
and is equal to q, on 2 in a neighborhood of (x9, €9). Then 


G1 (X, = Gy (x, + DY rye, 6) Xj(x, + DV (x, 6) Xx, 8) 


in a neighborhood of (xg, &9), where r; is homogeneous of degree 0. Choose 
0 and R, with principal symbols 4, and r;. Then the symbol of 


P—SX}X;—-O—YRFX,—-YXFR, 
is of order 0 in a conic neighborhood of (x9, €5) so this is also true for that 
of 
P—)(X,+R))*(X;+R)-Q. 
If weS° has support in a sufficiently small conic neighborhood of (xo, &9) it 
follows that we have a microlocal form of (22.3.5) 


(22.3.5) (Py (x, D) u, W(x, D) uj2 —C ilu], ue Co (K), 


for the operator Q is bounded from below by Theorem 18.1.14. The same 
result is quite obvious if y is supported in a smal] neighborhood of a non- 
characteristic point (x, €,). In fact, if Qe '¥},, has symbol q, dors . then P 
—Q*Q is of order 0 in a conic neighborhood of (xo, €o) if q?=p, and 24140 
=p* there, that is, q,=p3 and q,=p{ p;‘!*/2. 

To complete the proof of (22.3.5) we choose real valued wp ,(x, é)eS° with 
so small support that (22.3.5) is valid for each wy, and 


J 
Vv (x, O?=1 for x near K. 
1 


Then we have, with ¥,=w (x, D) assumed properly supported, 
(Pu, u)=)(P¥,u, Pu) + (1 — FY) Pu, u)t+ (LY, Plu, ¥u)). 


Here [¥,, P] is a first order operator with purely imaginary principal sym- 
bol so the self adjoint part of ¥*[%,, P] is of order 0. Furthermore, Diese ¥, 
—1 is of order —1 ina neighborhood of K, self-adjoint and with purely 
imaginary symbol of order —1. Hence this difference is actually of order 
— 2, so (22.3.5) follows from (22.3.5). The proof is complete. 
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If the inequality (22.3.4) is assumed strict, the other conditions in Theo- 
rem 22.3.2 can be relaxed and we obtain Melin’s inequality: 


Theorem 22.3.3. Assume that PeW2"(X) is properly supported and self-ad- 


joint. Let py, and p>, ., be the eaieipal and subprincipal symbols, and 
assume that (22.3.3) is fulfilled and that 

(22.3.4) pim_1(*, 6) + Tr*Qx,¢ > 0 if %,O ET*(X)\O,  pom(x, 6) =0. 
For every compact set K C X one can then find cx > 0 and Cx such that 
(22.3.8) (Pu, u) 2 cx lelln—1/2) — Cxllelm—a 4 © COCK). 


Proof. Exactly as in the proof of Theorem 22.3.2 we can reduce to the case 
m=1 and also show that (22.3.8) follows from analogous microlocal es- 
timates similar to (22.3.5). Let (x9,&,) be a characteristic point. We can 
choose local coordinates y,z there such that Q,, ., is equal to |z|?. Then the 
Morse lemma (Lemma C.6.2) shows that in a conic neighborhood of (x9, €,) 


P2(x, 6) 2 y b,(x, ey 
Here b, is homogeneous of degree 1 and }'db}=Q 
also choose 

c,(x, {=F 0,;b (x, ), 


where O is an orthogonal matrix, so that dc,(x, €)=A, . ; at (Xo, 9), 


at (Xo, ). We can 


xo, 60 


k 
Trt Q.,,e0= > (Cap C2j- 1} (Xo, Eo) 
I 


Here we have used the notation in the proof of Theorem 22.3.2 and also the 
fact that the arguments there are valid for the fixed point (x9, € ). 
Now we can split the symbol of P as follows, 


3 
P2(x, 6) + Pi (x, )=Las ¢), 
q'(x,€) = p2(x, €) _ > (x, 8), 
j 


S2k+1 
k 


q(x, €)= 2; e,(x, €)? — D, {¢aj€2j-1}, 
I j 


jS2k+ j= 


k 
g(x, = Py {0 j,C2j-1} + Pi &). 
J= 


We have q'20 in a conic neighborhood of (x9, & ) and q’ vanishes of third 
order at (x, €o). Hence it follows from Theorem 18.1.14 and the first remark 
following its proof that we can find g*eS' with principal symbol vanishing 
at (x9, é 9) such that if weS° has support in a small conic neighborhood of 
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(x9, ) and ¥ =w(x, D) then 
Re(q'(x, D) Wu, Pu) = Re(q*(x, D) Pu, Pu)—Cx|lull?, we Co(K). 


(We could have taken q*eSone by the Fefferman-Phong inequality but this 
precision is not required here.) With X ES ng properly supported and with 


principal symbol c,;_, +ic.;, j Sk, and c,, ;, j>k, we have 
Re (q*(x, D)u, u)—S'(XF X;u, uz — Cx lull’. 


In fact, the self-adjoint operator (q7(x, D)+q7(x, D)*)/2—).X}? X, is of order 
0 since the principal symbol is 0 and the first order symbol is purely 
imaginary. (These calculations are much more transparent if one uses the 
Weyl calculus but we have not wanted to require this background here.) 
Finally the real part of the principal symbol of g*+q° is positive at (x9, &), 
hence 


Re ((q*+4°) (x, D) Pu, Pu)2ex||Pull4,—Cx lull, weCo(K) 


if ex is sufficiently small. Summing up the preceding estimates we have 
proved the desired local version of (22.3.8). 


Remark. In Section 22.4 we shall prove that conversely (22.3.8) implies 
Pom=O and (22.3.4)’. 


As an application we shall now give an improvement of Theorem 22.2.6 
at points where Tr* Q, .>0. (See Proposition 22.4.1.) 


Theorem 22.3.4. Assume that Pe¥},,(X) is properly supported, and that the 
principal symbol p,, is non-negative in a conic neighborhood of (Xo, €)é 
T*(X)NO but vanishes at (Xo, &>). Let Q be the Hessian of p,,/2 at (Xo, &o), 
and assume that the subprincipal symbol p*,_ , satisfies 

(22.3.9) P51 (Xp, Eo) + Tr* O¢IR_ = {teRR, <0}. 


Then ueQ'(X), PueH,, at (Xo, Eo) implies that ue H,,, m— 1, at (Xo, €o)- 


Proof. As usual we may assume m=2. The hypothesis (22.3.9) means that we 
can choose a complex number z with 


(22.3.10) Rez>0, Rez(pi (xo, 9) +Tr* Q)>0. 


Then P=(zP+ZP*)/2 has the principal symbol Rezp,,20 and the sub- 
principal symbol Rezp}. If weS° has support in a small conic neigh- 
borhood of (xo,é>) and ¥=w(x,D) we have then by Theorem 22.3.3 if 
Kex 


| Pulls C’ Re(zPPu, Pu)+ Cxllull?, weCe?(K). 
Hence 


(22.3.11) | Pull SC? |zP Pull? +2Cglull?, we CX(K). 
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Let P/ and P, be properly supported of order 1 with principal symbols 
dp,/dé, and |é|~* ép,/éx, respectively. Then the principal symbol of 
P65 P* P35) P*P 

is non-negative near (x9, ) for small 6, as observed in the proof of Theo- 
rem 22.2.6. For small 6 we can apply Theorem 22.3.3 again and obtain 
(22.312) ) Pi Pull?+) PR Pull?<C Re(zP Pu, Pu)+C’ |u|? 

SC’ |zP Pull) |Pullgyt C” lull?, we CHK). 
(22.3.11), (22.3.12) are a microlocal version of (22.2.9), (22.2.10) with e=4 so 


the proof of the theorem is completed by inspection of the proofs of 
Lemmas 22.2.4 and 22.2.5. 


Not even the condition (22.3.9) is quite optimal. In Section 22.4 we shall 
determine the precise set which p*,_,(x9,€)) must avoid for the conclusion 
of the theorem to be valid. 


22.4. Hypoellipticity with Loss of One Derivative 


In this section we shall determine necessary and sufficient conditions on P 
for the conclusion of Theorem 22.3.4 to be valid when the condition 
Rep,, 20 is relaxed to 


(22.4.1) P(X, EL = {zeC; |Imz|<y Rez}. 


(Any other convex angle with opening <z could be used, but this is a 
convenient choice.) However, we shall first prove the converse of Theorem 
22.3.3 which we postponed since it is technically closer to the arguments in 
this section. 


Proposition 22.4.1. Assume that Pe'¥y,(X) is self-adjoint and that for every 
compact set Kc X 

(22.4.2) (Pu,u)2 —Cxllullgya—1, 4eCe (K). 

Then it follows that the principal symbol p,, is non-negative and that 


(22.4.3) ph_1(x, €)+Tr* Q, -20 if (x, Ge T*(X)\ 0 and p,,(x, €)=0. 


Here p;,_ (x, €) is the subprincipal symbol and Q, . is the Hessian of p,,/2 
at (x, €). 


Proof. It suffices to prove (22.4.3) when X CIR", x=0 and 0 is an interior 
point of K. Choose yeC?(X) equal to 1 in a neighborhood of K and 
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supp Pu for every ueCf(K). Then Pu=yPyu=p(x, D)u, ueC%(K), where 
peste and P~Py + Pmit-.. in K. We shall apply (22.4.2) to 


phg 
(22.4.4) u(yy=el Wyle|/), 
where weC¢(IR"). Then we have ueC¢(K) when |é| is large. Since ii(y) 
=W((n— SA?) 12|-""? we have 
(22.4.5) [ElP"—7* ull = lel 2-792)" f (1+ In? (in — OEP? dn 
=(2m)~" J VEL? +1EAS + nElAV7Y Wh (M7 dn Ili? 


when €->00. An elementary calculation gives 


(22.4.6) p(y, D)u(y)=e'? b(ylél*), 
(22.4.7) by)=(20) "fe p(yAEl®, €+nlel*) Wn) dn. 


By Taylor’s formula we have 
PONE? E+? = Yelle! !9P/2 pO, &) yon*/a! BI+ R.(y, 1), 


la+pjs2 


IRe(y, mS Cyl +lyl)31l"—>?, In <1E|2/2, 
IRQ, als CA +lyl+ ty)? + inf)? '™l, In| > |él4/2. 
In the first case this follows since |€+|€|?|>|é|/2 if |n|<|é]?/2 and in the 


second case we have estimated cach term separately and used that 
E+ |6|*|<6|n|?. Hence 


(22.4.8) Iey— LEP? DO, ) y? D*yi/at! BI 


ja+ Bi s2 
$C +ly)* er, 
which by dominated convergence implies that 
[E"—" (Pu, u) =[El-"(bz, W) =p, (0, E)1E1-™ wll? + O(WE|-). 
In view of (22.4.5) it follows from (22.4.2) that p,,(0,¢)20. If p,,(0,€))=0 it 


follows that p®’,(0, €))=0, |«+6|=1, and taking ¢=t¢, we obtain 


Je" *!—™(Pu, u)=|e'-"(bz, W) 
=((Q(y, D)+ Pm —1(0, Eo) WW + OVE *) 

where 0=Q, .,. Hence (22.4.2) implies 

(Q(y, D) + Pm_10, Fo) ¥ 20, pece. 
It is now convenient to switch to the Weyl calculus, noting that 

Q(y, D) + Pn 1(0, Fo) =O"(y, D) + Pm 10, Fo) 
since Q”(y, D) is obtained from Q{y,D) when y,D, is replaced by (y,D; 
+D,y))/2=y,D,—i/2. With the notation c=p;,_ ,(0,¢o) we have 


(22.4.9) (O°, DW W+ceW.w20, per 
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for C> is dense in “ We now use the important property of the Weyl 
calculus that such inequalities are invariant under composition of Q with 
linear symplectic maps (Theorem 18.5.9). To analyze (22.4.9) we may there- 
fore assume that Q has the form (21.5.3), 


k+l 


Oly, => HiV;j +n7)+ 2 Vi: 


The inequality (22.4.9) can then be written in the form 
k k+l k 
Yaylloy+iD IW? + Y lw? + (c+¥ ay) WIPZO, 
i k+1 1 
k 
as in Lemma 22.3.1. If we recall that Tr*Q=) #, and take 
1 


k 
WY) =e 78? x Vas s/€ «+5 Yulee, 
1 


it follows when e—0 that 
(c+Tr* Q) |x|? 20. 


Hence c+Tr* Q20 as claimed. 


The proof of Proposition 22.4.1 also leads to necessary conditions for an 
operator to be hypoelliptic with loss of one derivative as in Theorem 
22.3.4, First we must prove a lemma: 


Lemma 22.4.2. Let Pe'¥"(X) be properly supported, X <IR", and assume that 
ueé'(X) and PucH,, implies ue H,,, 1). For every compact set K<X one 
can then find Cx so that 


(22.4.10) lel s4m—1)S CaP ull + lellis+m—2ys 4ECo (K). 


Proof. The set of all ueH,,,,..9€(K) with PueH,, is a Banach space 
with the norm in the right-hand side of (22.4.10). Since it is embedded in 
€(K)OA,.4m—1) and the embedding is closed, the statement follows from 
the closed graph theorem. 


Let E, be a properly supported pseudo-differential operator with symbol 
(1+|é|?)"?. If we replace u by E_,u in (22.4.10) we conclude that (22.4.10) is 
equivalent to the same estimate with P replaced by E,PE_,, and s replaced 
by 0. If the symbol of P is p,,+p,,_;+.-.. then that of E,PE_, is 


Pm +Pm—1—islél“? ¥° €, Op,,/Oxj+... 


so the first two terms are not changed where dp,,/0x=0. Keeping this in 
mind we assume s=0 in the following proposition. 


—s 
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Proposition 22.4.3. Let Pe P(X) be properly supported and assume that for 
every compact set K< X cR" 


(22.4.10Y tlh n—1yS Cx(Pullcoy+ lullom—2y), ue Cet). 


Let Q(y,n) be any limit of (y,n)op,,(x+ ylel74, E+ ne?) [El -™ as x x° eX, 
E+ 00 and /|é|>€°. Then we have piipy(x®, €°)=0, la+ P| <2, Q is a quadrat- 
ic polynomial with principal part 
Y pg (x®, €°) y? n/a! B! 
ja+pl=2 
and it follows that 


(22.4.11) Jlwi?dySCJlQ”(y, D)W+pi_i0° 6) wiedy, pecP(R’). 
Here C is bounded by the constant Cx in (22.410) if x° is in the interior of 
K. One can replace Q by the homogeneous polynomial yo A(y/Yo, n/YVo). 


Proof. Let x, >x°, E/|E,|-> €° be a sequence giving the limit Q, and assume 
that x° is in the interior of K. We shall apply (22.4.10)’ to 


(22.4.4)' u(y +x) =e? W(ylé,I4) 

where weC?(R"), which is just a translation of (22.4.4). Then we have 
(22.4.5) gym? 28 lly Hey wl’, 

(22.4.6) Pu,(y+x,)=e'* o,(y/E,), 


(22.4.7/ — by(y= (2x) fe" p(x, + E18, & + M1607) On) dn. 
We claim that for any N 


(22.4.8) PO DLE a PP? ihc, €,) yo D* /ce! BI 


a+ Bl S2 


SC(t+ly)-™E,"-#. 
When N = —3 this is precisely the estimate (22.4.8). Since 
yb, (y)=(20)-" fe’? (—D, "v(x, + E13, & +1619) Wn) dn, 


the proof of (22.4.8) also gives (22.4.8) with N= —3 and any power y’ in the 
left-hand side. This is equivalent to (22.4.8). By hypothesis and Taylor’s 
formula 


EJ DY JE flat HPI? pith (xy, 6)? 1° 9 O(y, n) 


lat pi s2 
SO PMitpy(Xv» E/E) = OE? 72/2 — 1) 0 if fa] + (| <2, and 
[E,Py? +20 —™ Pu, I= HE By Ileoy 
> lO(y, D) b+ Py 1 0°, EV WI?, v0. 
Hence it follows from (22.4.10) and (22.4.5) that 


IW S Cg lO, DW + Pu 1°, O° WI. 
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As in the proof of Proposition 22.4.1 we have 
Q(y, D)+ Pm —1(%°, €°)=Q"(y, D) + Pin 1 (2°, €°) 
so this proves (22.4.11). If we replace ¢, by t?€, where t>0, then Q(y,n) is 


replaced by t? Q(y/t, n/t). Substitution of y(—y) for w(y) changes the sign of 
t, so this proves the last statement. 


Unfortunately the theorem gives no information on the lower order 
terms of Q. However, they can be determined easily if p,, vanishes precisely to 
the second order on the characteristic set 2 of p,,, that is, if for some 
positive constants C, and C, 


(22.4.12) Cz d(x, 6) Slpalx, YISC, d(x, 6); xeK, |é|=1, 
where d denotes the distance to 2. Indeed, let 

EL P(X + VIE I-73, &, +0161?) > Oy, 0). 
Then [6,1°~" Pm(Xy> Ev) =1Es1 P(X» €/1E,)) is bounded, so (22.4.12) gives 


d(x,, E/E) =O(E1-*). 


Choose (x,, &JeX with |Z,|=|,] so that |x,—x,|=O(/é,|~*) and |é,—&] 
= O(\é,|*). Passing to a subsequence we have 
JEL palx, + VEL 4, + ME) > Qoly, 2) 
where Q, is a homogeneous polynomial. If (x,—x‘){éJ?>y, and 
(€,-€)1E,1- 4 > my, it follows that Q(y,n)=Qo(yt+y1.n+). 
Now a translation of w or the Fourier transform w of y shows that a 
real translation of the polynomial Q does not affect the validity of (22.4.11). 


When (22.4.12) is valid it follows that the condition in Proposition 22.4.3 is 
not changed if we take the lower order terms of Q to be 0. 


Our next goal is to determine when an estimate of the form (22.4.11) is 
valid for some C and then to determine when the same constant can be 
used for a family of (Q,«), «=p%,_,(x°, €°). In doing so we shall keep in 
mind that Q is obtained from p,, as described in Proposition 22.4.3, which 
implies that 


(22.4.1) Q(x, Hel, (x, GeIR™, 
if p,, satisfies (22.4.1). At first we even assume that ReQ has a positive 
definite principal part. 


Lemma 22.4.4. Let Q be a complex valued quadratic polynomial in IR?" with 
principal part Q, such that ReQ, is positive definite. Let p,,...,u, be the 
eigenvalues of the Hamilton map of Q./i in the half plane ReA>O0, and let c 
be the value of Q at the unique point (z, €)eC*" where dQ(z, £)=0. Then 


(22.4.13) Ip sCllOr, D)W,ver 
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holds for some C if and only if 


(22.4.14) c+)'(20,+1)u;+0, OSa,eZ, 
1 
and then we also have for some other constant C' 
(22.4.13)' Y Iy’ Dewi SC’ lO", D) vl, veF 
la+pis2 


Proof. Let B be the Hilbert space of all wel? with y’D* wel’, |a+ Bi <2, 


and the norm 
( iy? Dey). 


ja+p|S2 
The operator Q” is a Fredholm operator from B to L’(IR"). In fact, e(x, €) 


= Q(x, )/(1+/Q(x, |?) is a symbol of weight (1+]|x|+]é))~? with respect to 
the metric 


(ldx|? +|d€|?)/(1 + |x|? +1€17) 
so e”(x, D) is a continuous map from L? to B. We have 

ev OV=I+RY, QO" e"=14+RE, 
where R, and R, also have weight (1+|x|/+{é])~*, so RY and R¥ are 
compact in L? and in B, which proves the Fredholm property. Also note 
that Q”u=0 implies u= —RYu=(—RY)\u for every N, hence ueS If ves” 
is orthogonal to Q” ¥ then v= — R¥v, hence ve X Since 

(x, €)r+1O(x, 2) + (1-2) (1 + [x17 +121?) 


satisfies the hypotheses in the lemma for 0<t<1 it follows that the index is 
the same as for the operator |x|?+|D|?+1 which is injective on Y with 
range dense in I? hence of index 0 as operator from B to L’. The index is 
therefore always 0, so (22.4.13)' is valid if Q”(x, D) is dense in L’. 

In order to study the kernel and cokernel of Q” we use Theorem 21.5.6 
and Corollary 21.5.8 to choose symplectic coordinates such that Q,(x, ix) 
=0. With teC" to be chosen later we set 


E = gi<%st> —|x17/2 
and note that Ee SY and that 

E~'Q”Ev=0"v 
where 


Olx, 2) =O(x, E+ix+t) 
is linear in x. If s is another vector in €” we have if v is an entire function 
5_,*0"(v#5)=R”0 
where v x 6,(x)=v(x—s) and 


R(x, )=QO(x+s, €+i(x+s)+ 0). 
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Now the radical of Q, is {0} (cf. Theorem 21.5.4) so the equation dQ(z, C) 
=0 has a unique solution, characterized by 


O(x+z,6+0)=Q,(x, 2) + Q(z, ¢). 
With s=z, t= (€—iz and the notation c=QO(z, €) we obtain 
RY’ v=Q3(x%, D+ix)vt+ev. 


The fact that Q,(x,ix)=0 means that R” maps polynomials of degree <p 
to polynomials of degree <y, for every yp. Two cases can occur: 
a) R”v=0 for some polynomial v+0. Then 


Q"(E(v#5.))=0 


so (22.4.13) is not valid. 

b) R”v+0 for every polynomial v+0. Then R is surjective on the space 
of polynomials of degree <p. Now the products of E,=Ex6, by poly- 
nomials are dense in I?. In fact, if fel? and 


{ fE,vdx=0 
for every polynomial v, then the derivatives of the entire analytic function 


(fE.) are 0 at 0, so fE,=0, hence f=0. Then (22.4.13)' must be valid. 

What remains is just to show that (22.4.14) is equivalent to injectivity of 
R” on the space of polynomials. Using the polarized form of Q, and the 
Hamilton map F defined by (21.5.1) we have 


Q,(x, E+ix; x, €+ix)=2Q,(0, 3 x; ix)+Q,(0, é) 
=24((0, é), F(x, ix))+ 2, (0, é). 


The space {(x,ix)} is spanned by the generalized eigenvactors of F with 
eigenvalues ip;, so F(x,ix)=(Mx,iMx) where M has the same eigenvalues. 


Hence 
RY” =2<¢Mx, D> +Tr M/i+c4+@,(0, D). 


The first three terms preserve the order of a homogeneous polynomial while 
the third lowers it by two units. Since the eigenvalues of a triangular matrix 
are determined by the diagonal elements it follows that Q,(0,D) does not 
affect the eigenvalues of R” as an operator on the space of polynomials. By 
Lemma C.2.2 in the appendix the eigenvalues of 2(Mx,D)>+TrM/i+c as 
an operator on homogeneous polynomials of degree uw are precisely the ~ 
sums 
2yYa;mjt+TrM/ite, lol=p 

Hence (22.4.14) means precisely that no eigenvalue is equal to 0, which 
completes the proof. 


Proposition 22.4.5. Let Q be a quadratic form in R*" satisfying (22.4.1), 
denote by yu, the eigenvalues in [XO of the Hamilton map of Q/i, and let V, 
be the space of generalized eigenvectors belonging to the eigenvalue 0. Then 


22.4. Hypoellipticity with Loss of One Derivative 373 


the estimate 

(22.4.15) IW SCi(Q"%, D)+K) Yl, weXUR"), 

is valid for some C if and only if 

(22.4.16) #+Q(6,v)+ > Qa;+1)yj; #0 when ve Vo and OSa;€Z. 


Proof. Since ReQ is semi-definite we can use Theorem 21.5.3 to choose 
symplectic coordinates such that 


k+I 


k 
(22:4.17) ReQ=)AF(xf+e7)+ DV x7; A;,>0. 
1 k+1 


In view of Theorem 21.5.4 it follows that Q is then a quadratic form in x’ 
=(X1,...5%,), 6 =(E,,...,6,) and x” =(x,44,---,X_4))) Thus Q”(x, D) only 
involves derivatives with respect to x’ and can be regarded as a differential 
operator in x’ depending on the parameters x”. The estimate (22.4.15) is 
therefore valid if and only if for every fixed x” 


(22.4.15) IWS ClO"(%',x”,D)+0)ul, yweX(R"). 
As observed after Theorem 21.5.4 


Sc=Im F/(Ker FnIm F)= @ V,; 
A+O 


here F is the Hamilton map of Q and Im F=(Ker F)’ is the complex x’ € €” 
plane, Ker FoImF is the &” plane, so (x’, é’) are symplectic coordinates in 
Sc. The map F’ induced by F in Sq is the Hamilton map of Q(x’, 0, €) which 
is therefore a non-degenerate quadratic form. By b) in Theorem 21.5.4 VY, is 
the Q orthogonal space of Im F, that is, V, is defined by d,,..Q(x’, x”, ¢')=0. 
Hence it follows from Lemma 22.4.4 that 


(22.4.15)" Iwi SCO) (O"O, x” D) +H) ul, wer(R, 


if and only if 
| K+Q(2',x",C)+YQa,+1)uj+0, OSajeZ 


when (2', x", C)eEVo. 

Any veV, can be written in the form v=v,+iv, where v,, v,¢V, and 
the x” coordinates are real, for (x’, €’) is a linear function of x”. Hence }—v, 
+iv, has x” coordinates 0, so 


Q(0, v) =Q(v, —iv,, v, +iv,)=Q(v1)+ Q(v2). 


The range of Q(0,v) when veV, is therefore the same convex angle as the 
range of Q(v) when veV, and v is required to have real x” coordinates. This 
proves that (22.4.16) is necessary and sufficient for (22.4.15)” to be valid with 
C(x")<oo for every x”. However, the fact that (22.4.13) is equivalent to 
(22.4.13) shows that the best constant is locally bounded. We can also take 
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C(x”) bounded for large x’. In fact, we have 
Re(Q(x,4)+KH)21_— if [x”|>M. 
This implies 
Iwi?SRe(Q"(, D)+K) YW), weS¥(R"), |x"|>M, 


as follows immediately from Lemma 22.3.1 if we remove the first order 
terms in x’, D’ by a translation of w and of w. Hence 


IW SWQ"(x, D)+K)w il, peX(R%, |x"|>M, 
which completes the proof. 


It is easy to see that the estimate (22.4.15) remains valid with a small 
change of C if « and Q are replaced by x’, Q’ and x—x’, Q—-Q’ are 
sufficiently small, rank Q =rank Q’, and the dimension of the zero eigenspace 
remains fixed. However, small perturbations of Q may increase the rank and 
decrease the zero eigenspace. To obtain explicit conditions for (22.4.11) to 
hold we must therefore study families of estimates (22.4.15). 


Proposition 22.4.6. Let M be a set of pairs (Q,«) where ke€ and Q is a 
quadratic form in R" satisfying (22.4.1). Assume that « and the eigenvalues of 
the Hamilton map of ReQ are uniformly bounded when (Q,«)EM. Then there 
is a constant C such that (22.4.15) is valid for all (Q, «)eM, if and only if there 
exist positive constants &, 6 such that 


(22.4.18) 5Sle+O(6, 01+ ¥ (2a, +1) u)| 


if (Q,x)eM, 0Sa,eZ, pu, are the eigenvalues in !~0 of the Hamilton map of 
Q/i and v is in the space spanned by the generalized eigenvectors of the 
Hamilton map of Q corresponding to eigenvalues of modulus <e. 


In view of the symplectic invariance and Theorem 21.5.3 we may assume 
in the proof that for every (Q,«)e€M the real part of Q is of the form 
(22.4.17). After splitting M into a finite number of sets we can assume that k 
and /| are fixed and that k+J=n. By hypothesis we have a uniform bound 
for 1,. It will be sufficient to show that every sequence (Q,, k,)eM satisfying 


J 
one of the conditions in the proposition has a subsequence satisfying both. 


Write 
k n 
(22.4.7) ReQ, = LANs] Feit 2. xf. 
+ 


Since there is a uniform bound for A;, we may assume that lim A, =A, exists 


for every j. The coordinates can be labelled so that 4,+0 when jn’ andi, 
=0 when j>n’. We shall write 


XH (Ky, Xy by XM HO Xe X= (Ky ses X,)- 
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By hypothesis |ImQ,|<yReQ,, so if we write x’/A, for the vector with 
coordinates x,/Aj,,n'<j<k, then 


iv? 
[Im Q(x’, x”/A,, x", 8 C/A Sy REQ, (x, x°/A,, 0°75 0, E'/A,). 


The right-hand side converges to 
(Daze +a + be? +e +x") 
1 


so the coefficients of Im@Q, are uniformly bounded after this dilation. Pass- 
ing to a subsequence if necessary we obtain 


Q,(x’, x"), xh & é"/2,) =} Ox’, x", x02 ae é”) 


where Q is elliptic in the indicated variables. Note that we have made a 
non-symplectic change of scales here. The justification for this will be given 
by the following lemmas, which also involve the limit 


Q(X, ) = Q(x’, 0, x”, ¢', 0) 


when the scale is not changed. The Hamilton map of Q, has n' non-zero 
eigenvalues in iI’, and the generalized eigenspace belonging to 0 is given by 
(x’, €) =H,,.(x’") where H,, is a linear map. For any convergent sequence of 
linear maps in a finite dimensional vector space, the linear hull of the 
generalized eigenvectors belonging to eigenvalues converging to 0 converges 
to the zero eigenspace of the limit. It follows that the space N, spanned by 
generalized eigenvectors of the Hamilton map of Q, belonging to eigenval- 
ues converging to 0 can for large v be written in the form (x’,é’) 
=H,(x",x", 2”), 


Lemma 22.4.7. H,(x"/1,,x',€"/A,) has a limit A(x’, x’, 6”) as v—-oo, and 
(x, €) = H(x", x’”, €) is equivalent to 


(22.4.19) O(x, & y, n)=0 when yl=y"=n" =n" =0, 
that is, 6Q(x, €)/A(x’, €’)=0. 


(If F and F,, are the Hamilton maps of Q and Q,, this means that F,, F(x, é) 
=0.) 


Proof. Q,(x',x"/A,, x", &,A, €) differs from Q, by a symplectic dilation so the 
space spanned by the generalized eigenvectors with small eigenvalues of the 
corresponding Hamilton map is defined by 


(x’, é)= H,(x"/A,, x A, &"), 


This equation must therefore converge to the equation of the zero general- 
ized eigenspace of the Hamilton map of Q(x, é’,0). Here we may interchange 
the roles of x” and €”, which proves that the asserted limit H exists. More- 
over, from b) in Theorem 21.5.4 it follows that (22.4.19) is valid if (x', €’) 
= H(x",x'",€") and either €” or x” is 0. In view of the linearity the last 
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condition can immediately be removed. Since (22.4.19) determines (x’, &’) 
uniquely in terms of the other variables, the lemma is proved. 


Lemma 22.48. The limit of the range of Q,{0,v) when veEN, is equal to the 
range of Q(0,v) when v=(x, &) and (x’, €)=H(x",x'",€"). This does not change 
if we require x",&”,x’” to be real. 


Proof. The range of Q, in question is the range of 

0, (x, x"/A,, x", &, ErThS x’, Pane x’, ha E"/2,) 
when (x’,€)=H,(x"/A,,x'", €”/2,). The quadratic form converges to Q, the 
restriction on (x, €) converges to (x’, &’) = H(x”,x’”, €”), and Q is elliptic in the 
variables (x’, x”, x’”, €’, €’) which proves the first statement. To prove the last 


one we just have to repeat an argument in the proof of Proposition 22.4.5, 
this time using (22.4.19). This is left for the reader. 


Proposition 22.4.6 is proved in one direction by Lemma 22.4.8 and 


Lemma 22.4.9. Assume that (22.4.15) is fulfilled by the sequence (Q,,,) and 
that k,—>«. With the preceding notation it follows that 


(22.4.16)' K+ Q(G,v) +> (20,4+1) uj;+0 


if u; are the eigenvalues in I of the Hamilton map of Q,,/i, 0Sa,€Z, and v 
= (x, ¢) satisfies (x’, ¢')= H(x", x’”, 6”). 


Proof. The last statement in Lemma 22.4.8 shows that we may assume that 
x", x’, " are real. By a rotation in the x,é, plane for n’<j<k we can 
arrange that €’=0. A symplectic change of scales shows that (22.4.15) is 
equivalent to 


WI SCHIOVOx"/A,, x" DAD) W+K, wl, weS(R", 
where C is independent of v. Letting v->0o’ we conclude that 
IW SCIQO"%, DL Oy+Ky|, yweS(IR". 


The zero eigenspace of the corresponding Hamilton map is defined by (x’, ¢’) 
= A(x”, x’, 0), so (22.4.16)' follows from (22.4.16) when €” =0. 


Remark. All positive multiples of the small eigenvalues which occur in 
(22.4.16) are contained in the range of Q on the corresponding eigenspace. 
Hence (22.4.16)' implies that (22.4.16) is satisfied uniformly by x,,Q, when v 
is large. However, simple examples show that (22.4.16)' is a good deal 
stronger than that. 


When proving the sufficiency of (22.4.16)' we shall regard Q” as a 
(pseudo-)differential operator in x”,x'” with values in operators on functions 
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of x’. To shorten notation we write t=(x",x'’,é"). From the proof of 
Lemma 22.4.4 we know that (22.4.16) implies that Q”(x’, D',t)+x« for every 
real t has an inverse E(t) which is a continuous operator from I? to the 
Hilbert space B of all ue I2(IR") with 


lulp=( > Ix DP ull*)*<oo. 
la+ pj) s2 


It follows that E(t) also exists and is an analytic function of ¢ when t is in 
some small complex neighborhood of IR"**~?". In particular, EeC®. 
Lemma 22.4.10. For all multi-indices a, B, y with \a+ B}| <2 we have 
(22.4.20) (1+ |e/)'+?-ll-18| |? xD? Eu SC, lull, weCZ(R"). 
Proof. It suffices to prove (22.4.20) when |t| is large. Assume first that y=0. 
There is a constant c>0 such that for large t 
e(|x'? +121? +e?) S Re(Q(x’, &, ) +x). 

Hence the end of the proof of Proposition 22.4.5 shows that 

c f(x’? + 1e]?) |v]? + |D’ v|?) dx’ s Re(Q”(x’, D’, t) + x) v, v) 

S|t}-? (Q” +) v||7/2c+e|e/ |[v||7/2, ve X(R"). 


Hence 

c? |t\? f ((1x'|? + (tl?) |v}? + [D’ vf?) dx’ s |(O"(x', D’, +) oll’, 
so we have 
(22.4.20Y YP y'* DF ol SCIO"(x', D.O+K) 0]. 


lat Bl <2 


(22.4.20) follows when y=0 if we combine this result with the estimate 


> |x*D? vl] S$ C(LQ"(x’, D’,0) v|| + llol) 
ja+pl=2 
which follows from Lemma 22.4.4. Since D,E=—E(D,Q)E, we obtain 
(22.4.20) inductively by repeated differentiations. 
It is clear that we may replace Q+xk in (22.4.20) and therefore in 
(22.4.20) by any sufficiently small perturbation which is independent of ¢’”. 
In particular we can for large v replace 0+ by 


Q(x’, x"/h,, bane &"/2,) + K, 
and obtain an operator E,(x”,x’’,é”) such that (22.4.20) is valid uniformly 
and 
(Q”(x’, x"/h,, es D’, E"/A,) + K,) E(x", x5 “" = I. 
Set 
Ei (x’, x”, é) =E,(A, x » a A, é"), 
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For ueCP IR") we denote by a(x’,é’,x’”) the Fourier transform with 
respect to the x” variables and set 


é, u(x) = (2 (3 a fete? E,(x", x", é") ti(x’, ip x) dé". 


It follows from (22.4.20) that the norm of D2... E, as operator in PR") is 
bounded by C’(1+|x”|+|é’))~"", so the vector valued version of Theorem 
18.1.11 proves that &, is uniformly bounded in 7(IR”), A direct computation 
gives 

(Q”(x, D)+k,)é,=I1+,, 
where &, is defined as &, but with E’, replaced by 


Rix", x", 6°) =(OM (x, D’, D" + £") OM (x, DY, £)) By", x", €") 
=R,(A, x”, x", A, é"), 
R(x", xl”, &") = (Q”(x’, JA, x D’, (A2 D” 4+- é’)/2,) 
= QO” (x’, x"/A,, bee D’, &"/A,)) E,. 
It follows immediately from (22.4.20) that 


k 
ID?RyulSC Y [Al + le)" lull, we CPR"). 
n’t+t 


Hence we conclude that the norm of #, as operator in 17(IR") tends to 0 as 
v—oo. Since we know that (Q”+x,) is invertible, the inverse is &,(1+2,)~! 
which has uniformly bounded norm as v-oo. The proof of Proposition 
22.4.6 is now completed in view of Lemmas 22.4.8, 22.4.9 and the remarks 
which followed the statement. 


Proposition 22.4.6 assumes a much simpler form if one has a compact 
family where Q has constant rank. This is the only case which occurs in our 
application when the characteristic set is a manifold and p,, is transversally 
elliptic. ; 


Corollary 22.4.11. Let M be a compact set of pairs (Q,«) of complex numbers 
k and quadratic forms Q in R?" satisfying (22.4.1). Assume that all forms Q 
occurring have the same rank r. Then there is a uniform estimate (22.4.15) 
when (Q,«)eEM if and only if (22.4.16) is valid for each (Q, k)eM. 


Proof. Let Q,-~@Q and assume that all Q, have the same rank as Q. Denote 
by Np the space of generalized eigenvectors of the Hamilton map of Q with 
eigenvalue 0 and by N, the space spanned by generalized eigenvectors of the 
Hamilton map of Q, belonging to eigenvalues +0. By Proposition 22.4.6 it 
suffices to show that the range of Q(0,v) when veN, is the limit of the range 
of 0,(6,v) when veN,, v—oo. It is obvious that dim N,=dim N, for large v, 
and N, resp. N, contains the null spaces Nj and Ng of the Hamilton maps of 
Q, and Q. Since Nj has the same dimension as N; by hypothesis, it follows 
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that N}— Nj. We can choose a linear bijection T,: N, > N, converging to the 
identity as v>oo so that T,N,;=N). Then Q, T, induces a hermitian form 
Qi, on N,/No converging to the elliptic form Q’ induced there by Q. This 
implies that the range of Q) converges to that of Q’, which proves the 
corollary. 


In Proposition 22.4.6 we have given necessary and sufficient conditions 

for the uniform validity of the test estimate (22.4.11). Our next goal is to 
prove a converse of Proposition 22.4.3. In doing so. it is convenient to 
assume m=1, which is no essential restriction. 
Lemma 22.4.12. Let Pe Pong(X) be properly supported with principal symbol 
p, satisfying (22.4.1). Suppose that for every compact set KcX there is a 
constant C such that (22.4.11) is valid for all limits defined in Proposition 
22.4.3 with x°eK. Then there is a function e(€) +0, E00, such that for every 
xeK 


(2.4.21) Iwi dy | 
(C+) {I » Deda (x, EE [2-4/2 y?D* w/a! B! + polx, Cl? dy 
atpls2 
+e(2)) >) ly D*pl\?dy, weCP(R’). 


la+p| <3 


Proof. We just have to show that (22.4.21) is valid for large |é| if «(€) is 
replaced by a fixed positive number «. Assume that this is false. Then there 
exists for some ¢>0 sequences x,€K and €,- 00, w,eCg, such that 


(22.4,22) 1=flp i? dy 
AEN » Poy (jo EME lt FP! y? D2 iJ! BI + po(x;,€) Wil? dy 
a+ pl s2 


t+ef Yo ly? Dt? dy. 
a+ plS3 
It follows that the sequences x*D’y, are precompact in I’ when |«+ B| $2, 
so we may assume that they converge to x*D* yw in I? for some wel’. Since 
(22.4.1) implies by Lemma 7.7.2 that for |j«+f|=1 


IP Stay(ox, ED [eI 9P? < C(Re py (x, 6))*, xe K, 


the sequence p,(x,,¢;) must be uniformly bounded, for the right-hand side of 
(22.4.22) would otherwise be asymptotically equal to |p,(x;,¢ pd? (C+1). 
Hence we obtain a limit Q as in Proposition 22.4.3 and a function w with 
x" D? wel?, |x| +|B|<3, for which (22.4.11) is not valid. Approximation of w 
with a C? function gives a contradiction which proves the lemma. 


When we use (22.4.21) it is convenient to change scales by substituting 
ylél? for y and then y—x for y, D—é for D (which amounts to a translation 
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of w or #). This gives the equivalent estimate 
(22.4.21y [iwl?dy 
S(C+ Dit » Play 6) (y — x)? (D — €)* e/a! B+ Dolx, C) Wi? dy 
at pl] <2 


+e(Z)f Sgt (y—x) (D—Owirdy, weX(R"), xeK. 


lat+Bls3 


Lemma 22.4.13. Assume that the hypotheses in Lemma 22.4.12 are fulfilled, 
and let Pi, P, be properly supported operators with principal symbols dp,/0¢, 
and \é|~* Op, /ax; respectively. Then there is for every compact set Kc X a 
constant C, such that for arbitrary 6>0 and ueCP(K) 


(22.4.23) lulls Cx | Pull(g) +9 (ieee) [Pal + DIP ula +C; lull 4)- 
1 1 


Proof. If yeC$(X) is equal to 1 in a neighborhood K, of K then Pu=Pyu 


= p(x, D)u, ueCP(K), where PES ne(R") has compact support in x and p is 


equal to the symbol of PmodS~® in K,. We choose ¥,EC? equal to 1 in 
K and with support in the interior of K,. The estimate (22.4.21) is valid for 
some e(€) 0, €- 00, for all xe Ky. 

We shall now localize with respect to the metric 


Bs=ldxP(O [E+ 1) +6467 (5718+), O0<d<1. 


The quotient by the dual metric is 5+; since the metric is slowly varying and 
o temperate when 6=1 it follows that this is true uniformly when 
0<6<1. Choose wy, C3 (IR?") and (x;,¢,)esupp w, so that y, is real valued, 


a fa 


y; is uniformly bounded in S(1,g;) and there is a fixed bound for the 
number of overlapping supports and for the g, distance from (x;,¢,) to 
points in suppy,. Then {y,} is a symbol in S(1,g,) taking its values in 7? = 
£(C,¢?)= L(¢*, ©), so the calculus gives 


d ly jx, Dull? =(u,u)+(r(%, D)u, u) 
where r has a uniform bound in S(é”, g,). It follows that 
(22.4.24) [ul|?<¥ |x, D)ull?+ Cd? jul’, ueCe. 


The proof of (22.4.23) will be complete if we show that the sum in the right- 
hand side can be estimated by the right-hand side of (22.4.23). 
If suppy, and supp x, are disjoint then Theorem 18.5.4 gives 


W(x, D)u=w (x, D) x, u= (x, D)u, ue Co (K) 
where {¢,} is bounded in S(67, g,). Hence 
(22.4.25) Y' ly jG D) ull? <Cd? ul?, we Co(K) 
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if $’ denotes the sum for such indices. For any finite sum we have 
(2.4.25) Di” liv x, Dull? SC lull). 


From now on we therefore assume that x,eK, and that |¢{ is larger than 
some suitable number depending on 6. Then we have by (22.4.21) 


(22.4.26) |W j(x, D) ull? S$(C + 1) | Tx, D) W(x, D) ul? 
as A Dy, Measl Dull’ 


where 
T(x, Dir d PY (x, E)(x—x,)? (D—¢,)*/a bp! + polx;, an) 
a+pls2 
Vapi O= WE — x) UE ID, + E — EN Wi, . 
Now (|é j* Dd, y, is for large j bounded in S(5'*!, g,) and the components of 
| |?(x~x,) and |€|-#(€—€,) are bounded in S(5~*,g,) in supp;, so Wap; is 
in a bounded set in S(6~**4l, g,). Hence 
(2.4.27) YY cE) lWapilx D) ull? =(Ru, u)S CO? ull? 
Ja+B|S3 j>J 
since the symbol of R is bounded in S$(67,¢;) if J is large enough. 
To estimate the first term in the right-hand side of (22.4.26) we observe 
first that the symbol of T;(x, D) p ,(x, D) is 


Y T(x, 2 Di lx, E)/a! =R x, 2) +S (x, 8, 


Ja} s2 
Rx, ¢) = » (T(x, €) — p(x, €)) DE w(x, S/a!, 
ja) $1 
S (x, = de pe é) dD. W (x, eyo ! i 2 pP(x;, i) W jay C)/a te 
als la{= 2 


Here T,(x,é)—p(x,¢) is bounded in S(6~7(1+67|é|)~*,g,) in suppw, by 
Taylor’s formula, for the norm of the k'" differential of p with respect to g, 
is bounded by (1 +|€|) ((67|é| + 1)/64(1 +1 é|)?)"? C,. Hence {R;} is bounded in 
S(6-7(1 + 67|E|)~4, g,) so it follows that 


(22.4.28) Y Rul]? SCd-* (14-67 {DIF ull? SC, |lull?_s). 
Finally 
S (x, D) = (x, D) p(x, D) +i W)(% D) Pyy(x, D) 
1 YW je) D) p(x, D) + Vix, D) 


where {V;} is bounded in S(6?+(1+067|&|)~*,g,). In fact, in the asymptotic 
expansion of the symbol of say w,(x,D)p(x,D) the terms improve suc- 
cessively by at least a factor (14+ 67E))#(1+|é)) “+, and (1+67/E) (14+ |&)“* 
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$6? +1/1+67|é), so the error when one breaks off after two terms 
has this weight. The last term in S, has a similar bound. It follows that 


¥ V(x, D) ull? S$ C54 ull? + C,lull2_,)- 


If we note that y and w,,) are bounded in S(57(67|é/+1)~*,g,) and in 
S((67|E| + 1)3, g,) respectively, we conclude that 


(22.4.29) }) |S,ul}? < C(ilp(x, D) ull? +5? Y lp (x, D) ull%, 
+°¥ Poy D)ull?_)+5* lull?)+ Cs lull? _ 4). 


This estimate combined with (22.4.28), (22.4.27), (22.4.26), (22.4.25) and 
(22.4.24) yields (22.4.23). 


From (22.4.1) it follows that Rep, 20 and that yRep, —Imp, 20, hence 
(2.4.30) 11 PPG, OI? +1817 Y brig VPS Cx Repilx,2), xeK, 
by Lemma 7.7.2, as already observed. Theorem 18.1.14 gives 
D p(x, Dy ull2+¥ bpp, D) ull? a 
SC, Re(p(x, D)u,u)+ Cx |lull?, we C2(K). 

This means that with new constants 

DV UPrully td WP uly S Cx(lPul + lull), 
which combined with (22.4.23) gives for small 6 

lulloy SCx(IPull + lull,_5), we Co(K). 


The logarithmic convexity of |ulj,,, in s shows that C,x|lu\|(_,, can be 
estimated by |jul|(o/2+ Cx |lull;_y), 80 we can replace |[ull,_,) by |lwil(_,) in 
the right hand side. Summing up, we have 


(22.4.31) lull +d IPiully td WPullay 
SCx(Pullot lula), we C3(K). 


This is a perfect substitute for (22.2.9), (22.2.10) with e=4 although there is a 
slight discrepancy in the subscripts caused by the fact that P is now a first 
order operator. However, nothing has to be changed in proving that 
(22.4.31) implies that P is microhypoelliptic with loss of one derivative. 
Summing up the results of this section we have therefore proved 


Theorem 22.4.14. Let Pe Pong(X) be properly supported and assume that the 


principal symbol satisfies (22.4.1). If for some se we have 
ue€ (X), Puc Hue As m—1) 
then there is for every compact set K<X a constant 6>0 such that 


5S |py 100°, 6°) + OG, +) Qa,+u, OSa,eZ, 
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if x°eK, |E°|=1, O(y, ¥o,9,M0) is any limit of 
VO Pm(X + VIEl~*/Yo, E+ mlEl*/yo) [El -™ 


asx — x°, & + oo, €/|€| > €°. The numbers 1; are the eigenvalues in T of the 
Hamilton map of Q/i, and v is in the space spanned by its generalized 
eigenvectors belonging to eigenvalues < 6 in absolute value. Conversely, if this 
condition is fulfilled then 


ue Q(X), Puc Hs) at (x°,£°) implies u € Hosm—1) at (x°, €°) 
for arbitrary s € R and (x°, €°) € T*(X)\ 0, so P is microhypoelliptic. 


The condition in the theorem is deplorably complicated, but. it simplifies 
if one makes stronger hypotheses on the geometry: 


Theorem 22.4.15. Assume in addition to (22.4.1) that the zeros of p,, form a 
C® manifold where p,, is transversally elliptic. Then the conditions in Theorem 
22.4.14 are equivalent to 


PS _1(x°, £9) + OG, v) + HQq+1)yj #0, OS a EZ, [é|=1 
if Pm(x°, €°) = 0, Q is the Hessian of Pm/2 at (x°,£°), pj are the eigenvalues in 
I of the Hamilton map of Q/i, and v is a generalized eigenvector belonging to 
the eigenvalue 0. 


Proof. By a remark after the proof of Proposition 22.4.3 we only have to 
consider the limit Q which occurs here, and Corollary 22.4.11 shows that 
the uniformity of the test estimate (22.4.11) is automatic. 


Notes 


In the 1950’s the results on hypoelliptic operators of constant strength 
proved in Section 13.4 seemed quite general, but Tréves [4] constructed an 
example of a hypoelliptic operator not satisfying these conditions for any 
choice of coordinates. Such examples can now be found in the class of 
hypoelliptic operators discussed in Section 22.1. It was introduced in Hér- 
mander [18] and reformulated in Hérmander [42] which we have followed 
here. For second order differential operators no such class of operators 
is invariant under coordinate changes. This motivated the study in Hor- 
mander [20] of hypoelliptic operators related to the equation of Kolmo- 
gorov [1]. (See Section 7.6 for the original treatment.) Simpler proofs of 
slightly less precise but more general results were later given by Kohn 
{2] and Olejnik-Radkevité [1], and we have followed: their arguments in 
Section 22.2. 

The precise lower bound in Theorem 22.3.3 is due to Melin [1]. It was 
observed in Hérmander [36] that the stronger Theorem 22.3.2 is valid 
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under additional assumptions. The proof of Theorem 22.3.3 given here 
exploits that fact. See also Hérmander [37, 39] and particularly Fefferman- 
Phong [1,2] for strong versions without such supplementary conditions, 
some of which were discussed in Section 18.6 above. 

Melin [1] proved that his inequality implies some cases of Theorem 
22.4.14 due to Radkevié [1]. Sjéstrand [3] went much further, and a great 
deal of the methods in Section 22.4 come from his paper; some special 
model cases had been discussed before by GruSin [2]. The generality was 
increased further in Boutet de Monvel [4], and Theorem 22.4.14 was proved 
in full generality by Hoérmander [35]. An independent proof of Theorem 
22.4.15 was given by Boutet de Monvel-Grigis-Helffer [1]. 

From the study of hypoelliptic operators with double characteristics 
there has developed an extensive study of hypoelliptic operators on nil- 
potent Lie groups. We shall not discuss these interesting developments here 
but refer the reader to Helffer-Nourrigat [1], Melin [2], Rothschild [1] and 
the references given in these papers. 


Chapter XXIII. The Strictly Hyperbolic 
Cauchy Problem 


Summary 


The Cauchy problem for a constant coefficient differential operator P with 
data on a non-characteristic plane is correctly posed for arbitrary lower 
order terms if and only if P is strictly hyperbolic (Corollary 12.4.10 and 
Definition 12.4.11). If the plane is defined by x,=0 this means that the 
principal symbol P,(é’,é,) has m distinct real zeros €, for every ¢' 
=(€,,...,€,_ ,)€IR"~1 0. In Section 23.2 we shall prove that this condition 
also guarantees the correctness of the Cauchy problem in a very strong 
sense when the coefficients are variable. The converse will be discussed in 
Section 23.3 and in the notes. 

The proofs in Section 23.2 depend on factorization of P(x,D) into first 
order factors of the form D,,—a(x, D’) where the principal symbol of a is one 
of the zeros of P,(x,¢’,é,). We shall therefore study such first order oper- 
ators in Section 23.1. The basic tool is the energy integral method. It could 
be applied directly to the operator P(x,D) but it is more transparent and 
elementary in the first order case which will therefore be studied first. 

In Section 23.3 we show that for operators of principal type the correct- 
ness of the Cauchy problem requires strict hyperbolicity away from: the 
plane carrying Cauchy data. However, certain types of double roots may 
occur at that plane. This situation is studied at some length in Section 23.4 
since it occurs for the important Tricomi equation. We shall also come 
across such operators in Section 24.6. 


23.1. First Order Operators 


In this section we shall study the Cauchy problem in R"*' 
(23.1.1) du/dt+a(t,x,D)u=f, O<t<T; u=qd when t=0. 
We shall assume 


(i) a(x, €)=a(t,x,&) belongs to a bounded set in S‘(IR"x IR") when 
OstsT; 
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(ii) ta, is continuous with values in Q’(R?") (or equivalently in 
c*(R*"); 

(iti) Rea(t,x, D2 —C, OSt<T. 

The assumption (iii) is natural for if a-is constant and f=0 then the 
solution of (23.1.1) is u=e-% ¢. 

It follows from (i), (ii) that trea(t,x, D)u is continuous with values in ¥ 
if ueS, for a(t,x,D)u is bounded in Y by Theorem 18.1.6 and continuity 
with values in C™ is obvious. In view of Theorem 18.1.13 it follows that 
a,(x,D) is a strongly continuous function of t with values in the set of 
bounded operators from H,,(IR") to H,,_ ,)(IR"), for every s. 

The following basic estimate is proved by the energy integral method 
which we already encountered in the proof of Lemma 17.5.4. 


Lemma 23.1.1. [fs € IR and if  € R is larger than some number depending on 
s, we have for every u € C'((0,T]; Hy) A C%([0, 7); Hs41)) and p € [1,0] 


T 
(23.1.2) G fle“ *utes dIkeyrdey'”? 


T 
S ||uO, Ics) +2 fe |\Ou/at +a(t,x, D)u||sdt, 
0 


with the obvious interpretation as a maximum when p = ov. 


Proof. First assume that s=0. By Theorem 18.1.14 it follows from con- 
ditions (i) and (iii) that for some constant c 


Re(a,(x, D) v, v)2 —c(v,v), veH,). 


Writing f=du/dt+a(t,x,D)u and taking scalar products for fixed t, we 
obtain 


2Re(f(t),u()e"?* =e ?**||u(t)||? + 2Re((a,(x,D) + A)u(o),u(pe?* 


o 
ot 
Ce 


—2at 2 
=a7¢ |u(o)|I 


provided that A=c. If we integrate from 0 to StS T it follows that 
t 
M(t?= sup e~***|ju(z) ||? < ju)? +2M(d) J e~** flac. 
OstSt 0 


Hence 
2 


(m@- fe*yenar) <(ilmon+ fe Wenidr) 
which for 4 =c implies 
e-"uCe] S oO] +2 fer UFC, 
If we multiply by e©-” it follows that 


eM ju(e)|| S > u(Oyl] +2 fe” Ir lead. 
0 
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When \ > 2c we have (c — A) < —A/2, and the norm of e~**/2 in LP(0, 00) is 
Ss (2/d)'/P if A > 0, which gives (23.1.2) when s = 0. 
To prove (23.1.2) for arbitrary s we set E,(D) = (1 +|D|?)"?, 
b(t, x, D)=E,(D)a(t,x,D)E_,(D). 


Since b(t,.)—a(t,.) is bounded in S° the properties (i){iii) remain valid for b. 
The estimate (23.1.2) follows from the same estimate with a replaced by b, s 
replaced by 0, and u replaced by E,(D)u. The proof is complete. 

The following existence and uniqueness theorem is suggested by (23.1.2) 
and it extends the validity of this estimate. 


Theorem 23.1.2. Assume that (i)-(iii) above are fulfilled and let seIR. For every 
fEL'((0,T); Hi) and PEH,, there is then a unique solution ue C([0, T]; H,.) 
of the Cauchy problem (23.1.1), and (23.1.2) remains valid for this solution. 


Proof. Uniqueness. Assume that f=0, @=0. Since a(t, x, D)u is a continuous 
function of t with values in H,,_ ,) it follows that 


ueC'([0, T]; Hy_y)AC(L0, T]; Hy). 


We can therefore apply (23.1.2) which s replaced by s — 1 and conclude that 
u=(, 


Existence. If a solution u of the Cauchy problem (23.1.1) exists and 
ve Colt{(t, x); t<T}) then 


{ (u —dv/dt+a(t, x, D)* yates v) dt +(¢, v(0)). 
0 0 


If we replace t by T—t and a(t,x,D) by a(t,x,D)* it follows from Lemma 
23.1.1 that for some C 


abe lor» 8 sc/ gl = —dv/0t+alt, x, D)* v 
Hence Hi 
ju, v)dt+(p, OSC J el. 4e 
5 
Using the Hahn-Banach theorem to extend the anti-linear form 
i ; 
gtr { (f,v)dt +(¢, v(0)) 
0 
we conclude that there is some ue L”((0, T); H,,) such that 
T T 
J(u, —dv/dt +a(t, x, D)* v)dt =f (f,v)dt +(¢, (0) 
0 0 


if veCy and t<T in suppv. Thus 

du/dt+a(t, x, D)u=f 
in the distribution sense when 0<t<T. Assuming for a moment that fe 
we obtain du/dteL”((0,T); H,,_,)), hence ueC([0, T]; H,,_,,). Using the 
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equation again we find that ueC'([0, T]; H,,_2)) and that u(0)=4¢. If feX 
and ¢e¥ we obtain if s is replaced by s+2 in the argument that there is a 
solution of (23.1.1) to which (23.1.2) is applicable. Now we can for arbitrary 
f and @ satisfying the assumptions in Theorem 23.1.2 choose sequences 
o,eF (R") and fe (IR"**) such that 


Ib Gly 7% FI SO-LOly at. 
Choose u,EC([0, T]; Hg, ) so that u,(0)=, and 
6u,/dt+a(t, x, D)u,=f,. 
Then it follows from (23.1.2) that u, is a Cauchy sequence in C([0, T]; H,,). 


The limit as v--oo is the required solution of (23.1.1). This completes the 
proof. 


Corollary 23.1.3. If u satisfies (23.1.1) with f=0, and pEH,,(IR") for every s, 
then ueC'([0, T], H,,(R")) for every s if this is true for some s. In particular, 
u(t, Je C™(R") for fixed t. 

We shall now study the wave front set of u(t, -) when beH,.. 0) = as \|)H., 
but ¢@ is not smooth. In doing so we strengthen the hypothesis (iii) to 
(iv) aeShn, and the principal symbol is purely imaginary. 

The direction of the time can then be reversed in Theorem 23.1.2. Suppose 


that (x9, ¢,)€T*(R")\ 0X WF(¢). We can then choose q~)'q;€S),, so that 
0 


do(Xo,&o)#0 but q vanishes outside such a small! conic neighborhood 
of (Xo, &) that q(x,D)@eC>P. We shall obtain information on the solution 
u of the Cauchy problem 


(23.1.3) du/dt+a(t,x,D)u=0, u(0,.)=¢, 
if we can find Q(t,x,D) with Q(0,x,D)=q(x,D) so that Q commutes with 
iD,+ a(t, x, D). Indeed, then we have 
(iD, + a(t, x, D))\Qu=0; QueCyp when t=0; 
so it follows from Corollary 23.1.3 that Qu(t,.)eC®. Hence the wave front 
set of u(t,.) is contained in the characteristic set of Q(t, x, D). 
With b; homogeneous of degree 1 —j and by real we have 


a(t, x, H/i~>y b(t, x, &. 
0 


The symbol Q(t,x,@) should be asymptotic to YO,(t, x, ¢) where Q; is 
homogeneous of degree —j. The principal symbol of the commutator 
[iD,+a(t, x, D), Olt, x, D)] is 
{t+bo(t, x, 2), Qolt, x, 2} = pee 
and the term of order —j is equal to 
(0/6t+ H,,)Q;+R; 
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where R, is determined by Qp,...,Q;_,. Thus we first have to solve the 
Cauchy problem 
(23.1.4) (0/6t+H,,)Q,=0; Qo=4 ) when t=0; 
and then solve inhomogeneous Cauchy problems of the same kind for Q,. 
The equation (23.1.4) means that Q,) is constant on the orbits of the 
Hamilton field 
(23.1.5) dx/dt=0bo(t,x, )/0€,  dé/dt= —Obolt, x, €)/0x. 
The uniformity hypothesis (i) guarantees that there is a solution with initial 
values : 
(23.1.6) x=y, &=n when t=0 
in a fixed interval OStStp if |y|=1, and the derivatives of (x, ) with respect 
to (y,7) have fixed bounds then. Because of the homogeneity of by the 
solution exists for all 7+0 when 0<tSty, and x,é are homogeneous of 
degree 0 and 1 respectively with respect to 4. We can pass from t=0 to any 
t<T in at most |T/t,|+1 steps which shows that the solutions exist for 
OstsT. If we write (x,¢)=y,(y,) it follows from (6.4.12) that y, is a 
homogeneous canonical transformation. Integrating for decreasing t we find 
that x, is invertible. The solution of (23.1.4) is now 

Dolt, x, E)=dolxe * (x, &)). 
The later equations 

(0/0t+ A,,)Q;+R,;=0 

can be written in the form 


00 (t, x.(y, 9))/Ot + Rj(x,(y, 0) =0. 


The solution which is equal to q; when j=0 is therefore 


t 
Oj(t, x.(y. ) =4j(, 1) —J R(x. 0) ds. 
0 
We have uniform bounds for all x,¢ derivatives of the functions Q; so 
obtained. If we choose Q(t, x, €)~}' Q(t, x, €) it follows that 
Q(0, x, ¢)—q(x eS” *, [iD + a(t, x, D), Q(t, x, D)]=R(t, x, D) 


where R(t,.) is bounded in S~® and continuous as a function of t with values 
in C°(IR?"). If u satisfies (23.1.3) with ~eH,_.,.)(R") we conclude that 


(iD, +a(t, x, D)) Q(t, x, D)ue C(L0, TJ; Hy) 


for any s, and that Q(t,x,D)ueH,, for any s when t=0. Hence it follows 
from Theorem 23.1.2 that this remains true for any ft. Since Q(t, x, €)+0 
when (x, )=y,(X9, &o), it follows that y,(xo, é))€WF(u(t,.)). The time direc- 
tion can be reversed, so we have proved 
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Theorem 23.1.4. If a satisfies (i)-(iv) and ueC([0,T]; H,_,.)(R")) is the 
solution of the Cauchy problem (23.1.3), with Pe H,_...\(R"), then WF (u(t, .)) 
=y7,WF(d) where y, is the canonical transformation obtained by integrating 
the Hamilton equations (23.1.5) from 0 to t, with by equal to the principal 
symbol of a/i. 


In the preceding discussion we have only considered the singularities of 
u for fixed t. If a € S'GR"*! x IR") we can also discuss the singularities of u as 
a distribution in IR"*?. In fact, although D,+a(t,x, D)/i is not a pseudo- 
differential operator it becomes one if we multiply by a pseudo-differential 
operator in (t,x) with symbol vanishing in a conic neighborhood of €=0 
(Theorem 18.1.35). Multiplying the equation (D,+a(t, x, D\V/i)u=0 by such 
an operator we conclude that when 0<t<T we have 


WF(u)< {(t, x; —by (t, x, é), é)} U {(t, xX, T, 0)}. 


From the proof of Theorem 23.1.4 we also obtain (t,x; 71, 6)¢WF(u) if €+0 
and Q,(t,x,é)+0. If for some reason we know that €+0 when 
(t, x, t, 2) WF (u), then it follows in view of Theorem 8.2.4 that 


WF (u(t, -))={(% €)5 (6, x; — bolt, x, €), Che WF (u)}. 


Thus Theorem 23.1.4 means that WF(u) is a union of bicharacteristics of 
t+b (t,x, €). This is a result of the same form as Theorem 8.3.3’; we shall 
return to it in Chapter XXVI. 


The solution of the Cauchy problem (23.1.3) defines a map 
FE: drult,.) 
such that by Theorem 23.1.4 
WF (F, 6)= x, WF (6). 


The analytic structure of F will be elucidated in Chapter XXV; it is an 
elliptic Fourier integral operator associated with the canonical transfor- 
mation y,. We could also have used the theory of Fourier integral operators 
in this section. However, this would have been less elementary, and the 
discussion above provides a concrete background for the study of Fourier 
integral operators in Chapter XXV. 


23.2. Operators of Higher Order 


We shall begin by extending Lemma 23.1.1 and Theorem 23.1.2 to operators 
of the form 


P =LBG x, D,) Di 


in IR"*?!. We shall assume that 
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(i) P,=1, and PeS”~/(R"** xR") when j<m, 

(ii) P. has a principal symbol p,(t,x,¢) which is homogeneous in € of 
degree m—j; thus P(t, x, €)—(1 —x(€)) p,(t, x, eS" *~/ if yeC (R") is equal 
to 1 in a neighborhood of 0; 

(iii) the zeros t of the principal symbol of P 


P(t, x, t, €)=)' p(t, x, 6) 7! 
0 


are real when €+0, and they are uniformly simple in the sense that for 
some positive constant c 


(23.2.1) lp(t, x,t, 2/At]2clé"—* if p(t, x,t, 2)=0. 
Denote the zeros t of p(t, x,t, €) by A, (t,x, )<A,(t, x, <...<4,(t, x, €). 
They are homogeneous in € of degree 1, and 
[DEDE ,A,(t, x, YS Cagle"! 


for this is true when |é|=1 by the implicit function theorem and (23.2.1). Let 
A,=(1 —x(€)) 4,(t, x, ¢). By commuting the derivatives D! in P to the left and 
using the identity D,=(D,—A,(t, x, D))+ 4,(t, x, D) we can find 

m—1 


Q;= Y Q,,(t, x, D,) Dt 
it) 
with Q,,¢S"~*—* such that 
(23.2.2) P—(D,—A,(t, x, D,)) Q;=R (t,x, Dy). 


It is clear that R;¢S”. The principal symbol p—(t —A,)q; i is independent of t 
so it is equal to 0, which proves that in fact R,; eS” . Later on we shall 
improve the crude factorization (23.2.2) but it suffices for the extension of 
Lemma 23.1.1. 


Lemma 23.2.1. If seR and T>0 we have for every u in ny C3([0, T]; 
Hoim—1—9) With PueL'((0,T); Hy) 


(23.2.3) jue d Diu, Mieam—1—H 
<C,2(5 |DiW(0,JIkeam—1- p+} iPut hed). 


j<m 
Proof. By Theorem 23.1.2 we may apply (23.1.2) to Q,u with a= —iA,. This 
gives for large 4 and p € [1,00] 

T 
(23.2.4) (4 lle“ Que, df 40)"”? 

0 

S QjuCO, Ico) +2 fe (Puce, Mey + Clue, Ilcorm—n) at. 
0 
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If q; is the principal symbol of Q; then 
q(t, xX, T, é) = II (t —A,) 


vi} 
so it follows from Lagrange’s interpolation formula that 
t= Sat, x,t, QA [[G;-A),  k<m. 
ve 
Let 
M,;(, x, ¢) a (1 —x(6)) A ,{t, x, é/[] (A,(t, x; ¢) —A,(t, x, ¢)) 
vrij 


which is in S***~™GR"*+? x IR"), Then we have 


: m—-1 
Diu=>M,,(t,x, D)Q,u+ ¥ R(t, x, D) Diu 
0 
where R,,eS*~/—'. Hence 


x \| Dru(e, eee 1-1» cy. |Q, u(t, Mat || Dru(t, Jlteem-2—~%)* 


k<m 


In the last sum we can drop the term with k = m — 1, for ||Qiu — D”—"ull¢s—1) 
can be estimated by the other terms. Thus 


T T 
= ( i lle“ DE ut, Mleam—1—-yrA42)'”? S COONS lle Qjule, DIR yd ae)!” 
m 0 


T 
+ DY (flle-*Diug,.) 
10 


k<m— 


(aera, ae, dt)'/?), 


Again by (23.1.2) the last sum can be estimated by 


T 
2 y || DF u(, -) Ilie+m—2—-m +4 >, f lew DE** u(t, )Ilgctm—2-n At, 


k<m-1 k<m-10 


so we obtain 


T 
Of le~™ DE ut, MPeam 1a 40"? S CCX [Df UO, -)Ilcorm—1-1 
k<m 0 k<m 


T T 
+f le~** Pu(t, .)|\s) dt + > f lew DF u(t, )\\cerm—1-K) dt). 
0 k<m 0 


When p = 1 and 4 is large, the last sum is less than 1/2C times the left-hand 
side. Hence we obtain, first for p = 1 and then for arbitrary p, with a new constant 
C independent of s, p, and A when 2 is large 


T 
(23.2.4) Xf lew DE ult, Mees —1—0y> 40)? 
k<m 0 


T 
SCY DF uO, Mlerm—1-— + f Pu, Iles) a), 
k<m 0 


which is a much stronger estimate than (23.2.3). 
Next we prove an analogue of Theorem 23.1.2: 
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Theorem 23.2.2. Assume that {i)-(iii) above are fulfilled. For arbitrary 
fEL'((O,T); Hy) and $;€Hie4m—1—p» i<m, there is then a unique solution 


m~1 
ue () Ci((0, 7]; His+m—1-p) Of the Cauchy problem 
0 
(23.2.5) Pu=f if 0<t<T; Diu=$, for j<m if t=0. 


Proof. The uniqueness is an immediate consequence of Lemma 23.2.1 which 
gives u=0 if f=0 and ¢)=...=¢,,_,=0. To prove the existence we note 
that if u is smooth and v is in C?(R"*'), t<T in supp», then 


(23.2.6) f(u, P*v)dt=f(Pu,v)dt—i Y (Dju(0), D'P,,, (t,x, D)* v0). 
0 0 


jtk<m 


Now it follows from (23.2.3) applied to P*, with t replaced by T—t, that if 
g=P*v then 


» DF v(t)il_.-»S cs Iga _m—s at. 


This implies that , fu t)dt—i Sp Disa 1(% DI 00) 


T 
<C (j Uhadt+ Ehsan) Oat 
t¢] 9 


Using the Hahn-Banach theorem to extend the anti-linear form in g on the 
left-hand side we conclude that there is some ue€L”((0,f); Hyg 4.m_—1)) such 
that 


(23.2.6) es v)dt iD ($,, DE Parail x, D)* v0) = Fu, Propat 


if veCg and t<T in suppv. This implies that Pu=/f if we take v with 
0<1t<T in suppv; that the Cauchy data (23.2.5) are assumed will follow by 
comparison with (23.2.6) when we have established enough regularity for wu. 

In view of the estimate (23.2.3) it suffices as in the proof of Theorem 
23.1.2 to prove existence when f and ¢, are in “ From the fact that 
ueL?((0,T); Hi+m-1)) and PueS -we shall deduce that DiueL(Q, t); 
Hoi+m—1—p) if OSjSm (this is in fact true for every j). The proof is a minor 
variation of that of Theorem B.2.9. First note that Theorem B.2.3 remains 
valid if we define A,,,,) as the set of restrictions of elements in Hm,s)(R"*') 
to (0, T) xR". Indeed, if yeC~(IR) is equal to | in (—0o,4) and 0 in (4, 00) 
then Theorem B.2.3 is applicable to yu and also to (1—y)u if t is replaced 
by T—t. Now we know for the solution of Pu= f constructed above that 

DiueHy-_js+m-1-m> JS, 
if k=0. Assuming that this is known for some k<m we obtain 
Dru=Pu— i P(t, x, D,) Dive Hams tis+m—2-b 
i<m 

By the modification of Theorem B.2.3 just discussed it follows by induction 
for decreasing j that Diue Fast —jsem—e-2) 
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which means that k is replaced by k+1 in the hypothesis. Thus it is true for 
k=m which means that 


na! 
DiueH oq _ j,s—1)> 


and implies that DiueC((0, T]; Fg4m—j—3/2) by Theorem B.2.7. This is 
enough to make (23.2.6) valid. If we compare (23.2.6) and (23.2.6) when Dip 
=0 for t=0 and j<m—1, we obtain 


(u(0) — 9, D?~ *v(0))=0, 


hence u(0)=@,. Taking Div=0 for j<m-—2 only we then obtain D,u(0)=¢, 
and so on until all the Cauchy data (23.2.5) are verified. From the unique- 
ness already established it follows that the Cauchy problem (23.2.5) with 
feS and all ¢,ef has a solution ue’) c™-1([0, T]; H,,). Using the equa- 


tion Pu=f repeatedly we obtain ue( | C*( ((0, T]; H,.)< C®. This completes 
the existence proof. 


jgm, 


The main purpose of this section is to study strictly hyperbolic differen- 
tial operators: 


Definition 23.2.3. A differential operator P of order m in the C® manifold 
X, with principal symbol p, is said to be strictly hyperbolic with respect to 
the level surfaces of @¢e€C*(X,R) if p(x, d’(x))+0 and the characteristic 
equation p(x,é+1 ¢'(x))=0 has m different real roots t for every xeX and 
EET *#\ RG'(x). 


The Cauchy problem for a strictly hyperbolic operator always has semi- | 
global solutions: 


Theorem 23.2.4. Let P be a differential operator of order m with C® 
coefficients in the C” manifold X, and let YE X be an open subset. Assume 
that P is strictly hyperbolic with respect to the level surfaces of peC”(X,R), 
and set 


X,={xeX; o(x)>0}, Xo={xeX; b(x)=0}. 


If fi eH ( X) has support in the closure of X, one can then find 
Uue€HS, m—1)(X) with support in the closure of X, such that Pu=f in Y. If 
s20, v is a vector field with vp=1, and feH(X,), WEES m—1—p(Xo)» 
j<m, then there is some ue HS, m_ W(X) such that Pu= fin X,Y and viu 
=, in XgOY when j<m. 


To prove the theorem we shall first study the case where X CR” and 
o(x)=x,. By hypothesis the coefficient of D” is never 0 so we can divide by 
it and assume henceforth that it is equal to 1. Let 1,(x, €)<...<4,,(x, ¢), 
é'=(é,,...,€,_,) be the roots of the equation p(x, ¢’,4)=0, choose ye Cg (X) 
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with O<yS1 and y~1 in Y, and set 
B(x, )=[](E, A(x, €), 
1 


Ajlx, EV=1O) Aj EV + — x) 1E'- 


We can then define P with principal symbol p so that P(x, £)= P(x, &) when 
xeY and Ps é') is independent of x outside a compact subset of X. Then 
the hypotheses of Theorem 23.2.2 are obviously fulfilled with x, playing the 
role of t and (x,,...,X,_,) in the role of x. 


Proposition 23.2.5. Assume that P is strictly hyperbolic with respect to the 
planes x,,=constant in X <R", and let YEX. If fel, jUR",) and s20, and if 
Vs€Hoscem—1 OR" 1), j<m, one can find u€H.m_—1,)(R",) such that Diu 
is a continuous function of x, with values in Hy, 44+m—1—)GR"~*) when x,20, 
if j<m, and ; 


(23.2.7) Pu=f in YOR, Diu=, in YA(R"-!x{0}) if j<m. 


Proof. Choose T>x, for xeY. The hypothesis on f implies that 
fel’ (0, T), H..)(R"” by so it follows from Theorem 23.2.2 that one can 
find ue () Ci([0, 7]; Ho s:4m—1—p) With Pu=f when 0<x,<T and Diu=y, 


j<m 
when x,=0, j<m. Since ueH,,,_ Ls+0 in the slab {x;0<x,<T} and 
PueH,, » there, it follows just as in the proof of Theorem 23.2.2 that 
U€H a, 145,9» Which completes the proof. 


Proposition 23.2.5 gives a local version of the second part of Theorem 
23.2.4; we shall now prove a local version of the first part. 


Proposition 23.2.6. Assume that P is strictly hyperbolic with respect to the 
planes x,=constant in X CIR", and let YEX. If feH,, OR") we can find 
UH sm 1,)0R",) such that Pu=f in Y. Here s (or t) may be +. 


Proof. If s=0 it follows from Theorem 23.2.2 after x, is replaced by x, —1, 
say, that we can find UEH 1,542) With Pu=f when x,<T. As in the proof 
of Theorem 23.2.2 it follows that ueH,,,,1,) by an analogue of Theorem 
B.2.9. When proving the theorem for s<0 we may now assume that it is 
already verified when s is replaced by s+1. By the remark after Theorem 
B.2.4 we can write 


f=fotD, tn 


where f,€H,,, 1,11) and fen ea, y: Choose ug€ Hy, 4m,¢—1) and u,eH 
such that 


(s+m,5 
Pu,=fy and Pu,=f,. 
Then U=uyt DU, A m1, and 
Pu-f=[BD,]} UH e 1,¢-4)2 
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since [P,D,] is of order m but only of order m—1 in D,. By the inductive 
hypothesis we can find Ls : ee 1) such that Po=[P, D, nlu,, and u=U—v 
is then the desired solution. 


To prove Theorem 23.2.4 by piecing local solutions together we need a 
uniqueness theorem: 


Theorem 23.2.7. If P is strictly hyperbolic in X with respect to the level sur- 
faces of $, then every x eX has a fundamental system of neighborhoods V 
such that ue@'(V), 62 O{x,) in suppu, and Pu=OQ in V implies u=0 in V. 


Proof. We can take a local coordinate system at x, such that x,=0 and 
$(x)=$(0) +x, —|x'|?. Set 


V, = {x; |x, | <8”, [x’|<e}. 


Since p(0, ¢’, €,) is of degree m with respect to €, and has m different real 
zeros if €’eIR"~'\0, the same is true for p(x, &,é,) if xeV,, O<e<ép, for 
p(x, €) is essentially real (cf. Lemma 8.7.3). If geC?(V,) it follows from Prop- 
osition 23.2.6 applied to P* that we can find veC”(V,) such that P*v=g in 
V, and v(x)=0 if x,>e?—6 with 6 so small that g(x)=0 when x,><«7—6. 
But then we have 

0=(Pu, v) =(u, P* v) =(u, g) 


so u=0 in V,. Thus the neighborhoods V, have the desired property. 


Proof of Theorem 23.2.4. We can find a finite number of coordinate patches 
X,cX and open sets ¥,EX, such that Yc|)Y,. To prove the first state- 
ment in Theorem 23.2.4 we use Proposition 23.2.6 to find for every v some 
Ue HS m—1)(Xy) vanishing when @<O such that Pu,=f in Y,. Choose a 
covering of X,Y by open sets V, such that Pv=0 in V, and v=0 when 
@<0 implies v=0 in V,. We choose V, so small that for all y, py’ with 
VV, +9 we have V,UV,<Y, for some v. Now define u=u, in V, if V, 
<Y,. By Theorem 23.2.7 we have u,=u, in V, if V,.cY,OY,., so the choice 
of v does not matter. The definition made in V,, agrees with that in V,, in the 
overlap, since V, and V,, are contained in Y, for the same v if the intersec- 
tion is non-empty. It is clear that Pu=f in V=(JV,,. 

Choose yeC?(V) equal to 1 in a neighborhood W of X,Y. Then f 
= P(yu) in W. Hence we can find geH (xX) and ¢>0 such that g=0 when 
o(x)<e and f=P(yu)+g in Y. It Bilices therefore to prove the statement 
with f replaced by g and ¢ replaced by ¢—«. A moment’s reflection on the 
geometrical nature of the proof of Theorem 23.2.7 shows that the same ¢ 
can be used when ¢ is replaced by @—t for some t>0. Since the statement 
is trivial when @<0O in Y, it follows after a finite number of iterations of the 
preceding local solution. To prove the second statement in Theorem 23.2.4 
we just start by using Proposition 23.2.5 instead of Proposition 23.2.6. The 
construction then continues as before. The proof is complete. 
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We shall next discuss the singularities of solutions of the Cauchy prob- 
lem (23.2.5). As a preparation for the proofs we refine the factorization 
(23.2.2). 


Lemma 23.2.8. If P satisfies the conditions (i)1ii1) at the beginning of the 
section then the operators A, and Q, can be modified with terms of order 0 
and m—2-—k so that the right-hand side R, in (23.2.2) is in S ~©R"*! x IR”). 


Proof. Suppose that we already have (23.2.2) with R,;eS"~" for some p>0, 
as we do for u=1. We can choose AeS1~4(IR"*! x R") so that 


R,(t,x, Q+ACL, x, “yy Onlt,x, 2) Alt, x, SFes"—"-}, 
0 


for R,;eS"~" by hypothesis and the sum is elliptic of order m—1 by (23.2.1) 
since the principal symbol is p‘(t, x, A;,¢). Now it follows as in our earlier 
discussion of (23.2.2) that we can choose q,¢S"~'~*—* so that with the 
notation A,=A;+A the symbol of 


m—1 m—2 


R,{t, x, D,)+ A(t, x, Dy) Y Q y(t, x, D,) Dk —(D, —A,(t, x, D,)) oS q,(t, x, D,,) Dt 
0 i) 


is independent of 7 and belongs to S"~#—!. If A; is replaced by Aj and Q jk 
is replaced by Q,,+9, when k<m-—1, we obtain a new identity of the form 
(23.2.2), now with R,eS"-"—'. If we repeat the argument for y=1,2,... and 
add to A;, Qj the asymptotic sums of the successive corrections 4“, qi (see 
Proposition 18.1.3), we obtain (23.2.2) with Rj € S~©. 


We shall now discuss the singularities of the solution of the Cauchy 
problem (23.2.5), starting with an interior regularity result similar to Theo- 
rem 8.3.3’. It is no restriction that we assume the principal symbol of the 
hyperbolic operator real. 


Theorem 23.2.9. Let P be a differential operator of order m with C® 
coefficients in the C® manifold X, and assume that P is strictly hyperbolic 
with respect to the level surfaces of 6€C*(X,R). If Pu=f it follows that 
WF (u)\ WF (f) is a subset of p~'(0) which is invariant under the flow defined 
by the Hamilton vector field of the principal symbol p. 


Proof. That WF(u)\ WF(f)<p~'(0) is already a consequence of Theorem 
8.3.1. Since WF(u) is closed we just have to show that the intersection with 
any bicharacteristic arc y of p is open in the complement of WF(f). This is a 
purely local statement so we may assume that X CR’, that ¢(x)=x,, and 
that y,>=(0,€,)eWF(u)~ WF(f); we must prove that WF(u) contains a 
neighborhood of y, on the bicharacteristic y of p through y. Since (x, N) is 
non-characteristic if xeX and N=(0,...,0,1), we have €,+0. We can there- 
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fore choose aéS°(IR" x IR") equal to 1 in a conic neighborhood of y, and 
equal to 0 at infinity in a conic neighborhood of X x RN as well as for all x 
outside a compact subset of X. Set v=a(x, D)u, g=P(x, D)v. Then veé’(X), 
YoEWF(v)\ WF(g), and 


(23.2.8) (R" x RN) AWE (v)=96. 


We must prove that this implies that WF(v) contains a neighborhood of yy 
on y. 

First note that if ReS-°(IR" x IR"-') then R(x, D')veC®. In fact, we can 
write 

R(x, D')v=R(x, D’)q(D) v+ R(x, D’)(I—q(D)) v 

where q(£)<S° has support in such a small conic neighborhood of N that 
q(D)veC®, hence R(x, D’)q(D)veC®, but g=1 at infinity in another conic 
neighborhood. By Theorem 18.1.35 we have R(x, D’)(I —q(D))EOpS~ ©, hence 
R(x, D‘\(I —q(D)) ve C®. In particular, we have R(x, D'}veC® if R=0 ina 
neighborhood of supp v. 

Choose P as in the proof of Proposition 23.2.5. Then we obtain Pv 
—PveC®. Using Lemma 23.2.8 we can factor P 


P(x, D)=(D, ~A,(x, D’)) 6 (x, D) + R(x, D’) 


where D, -A, is characteristic at yo but oF is non-characteristic there; 
R, eS~°, "Set we 6; (x, D)v. In view of Theorem 18.1.35 we have y»eWF(w), 
and since R, veC™” as proved above, we obtain y,¢WF((D, —A, j(, D’)) w). By 
the discussion after Theorem 23.1.4 it follows that WF(w) contains a neigh- 
borhood y, of y) on y. Using Theorem 18.1.35 again we conclude that y, 
C WF(v) C WF(u), which completes the proof. 


Remark. The function ¢ plays no role at all in the conclusion of Theorem 
23.2.9. This suggests that hyperbolicity is not essential, and we shall prove 
later (Theorem 26.1.1) that it suffices to assume that the principal symbol is 
real. 


We shall now discuss the regularity of the solution of the Cauchy prob- 
lem, assuming that P is a differential operator of order m with C® coef- 
ficients in the C® manifold X which is strictly hyperbolic with respect to 
the level surfaces of 6€C”(X,R). The sets X, and X, are defined as in 

’ Theorem 23.2.4 and we choose a C® vector field v with vg/=1. Assume that 
feN(X ,) (Definition 18.3.30), and assume that ueQ(X ,), Pu=f in X,. 
Then it follows from Corollary 18.3.31 that u is the restriction to X, of a 
unique element ie ./(X ,), and so is v/u. This implies that the boundary 
value , of viu is well defined. By Theorem 18.3.27 (ii) we have WE,(f) 
< WF,(u), and Theorem 18.3.27 (iv) gives WF(o ,) < WF,(W)OT*(X 9). 


Theorem 23.2.10. Under the preceding hypotheses we have 


- fm—-1 
(23.29) WEstilxy= WENixo (U. WEG): 
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hence WF(u) is contained in the union of the forward bicharacteristics starting 
at WF(f\x,) or at points in the inverse image of the right-hand side of 
(23.2.9) under the natural map 


T*(X)lx, > T*(X 0). 


Proof. The last statement follows from the first if we note that WF, is closed 
and apply Theorem 23.2.9. Thus we only have to verify that the left-hand 
side of (23.2.9) is contained in the right-hand side. This is a local statement. 
After multiplying u by a suitable cutoff function we may therefore assume 
that X CR", o(x)=x,, v=0/dx,, ie&'(X ,) and that (x, )eWF(u), x, >0 im- 
plies €’+0. (See the proof of Theorem 18.3.32.) 

Define P(x,D) as in the proof of Proposition 23.2.5 and form the factor- 
izations 7 . 7 . 

(D,, — A(x, D’)) Q (x, D) = P(x, D) + R (x, D’) 
where R,eS-*(IR"xIR"~') as in the proof of Theorem 23.2.9. Let 
yeT*(X,)~0, and assume that y is not contained in the right-hand side of 
(23.2.9). We must show that y is not in WF,(u)|,,. From the proof of Theo- 
rem 23.1.4 we know that ®,(x, é’)eS° can be chosen non-characteristic at y 
so that 
[D,, —A,( (x, D'), © (x, D')] = T;(x, D’) 


where T; is of order —oo and 9, is of order —0o outside any given neigh- 
borhood of the bicharacteristic of Ek j(x, 6’) passing through y. Now we 
have outside X, 


(D, —A,(x, D’) ® (x, D)O(x, D)u ~ 
=T,(x, D) 6,(x, D) ut+ B(x, D')f+(x, DR (x, Du 


where f= P(x, D)u is equal to f in a neighborhood of suppu, outside Xo. 

All terms on the right are in Y by Theorem 18.3.32. Since @,(x, D’ )R; (x, D’) 
and T,(x, D' )0; (x, D)u are of order — co it follows that the right- -hand side i is in 
Cc for OSx,<é, say, if the support of @,(x',0,¢') is in a sufficiently small 
conic neighborhood of y. The boundary ‘values of 0; (x,D)u are given by 
pseudo-differential operators acting on ¢9,...,$,_ 1 80 the boundary values 
of (x, D’ )0; (x, D)u are in C® if the support of ®, is in a small conic neigh- 
borhood of the bicharacteristic through » . Hence it follows from Theorem 
23.1.4 and the remarks after its proof that ® (x, as D)u is in C® for 


0<x,<e. Now the operators 0 (x, D)= > O ,,(x,D’) Dk, j=1,...,m can be 


regarded as a Douglis-Nirenberg even aeaey. with the operator 
(1,D,,...,D™~1) (see Section 19.5). Hence we can find pseudo-differential 
operators W, and V, such that 


m-1 
W,(x, D') ® (x, D') 6 (x, D)u= : V,(x, D’) Dk u 


+-Ms 
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where V,(x’,0,é)) is non-characteristic at y but V.eS~® for k+0. Hence 
V(x, D)ueC® for small x,20, and it follows from Theorem 18.3.32 that 
yéWFE,(u). The proof is complete. 


23.3. Necessary Conditions for Correctness 
of the Cauchy Problem 


Definition 23.2.3 was motivated only by the constant coefficient theory de- 
veloped in Chapter XII. One might therefore suspect that it is too restric- 
tive. However, we shall now prove that a weaker hyperbolicity condition is 
necessary for an existence theorem like Theorem 23.2.4 to hold. 


Theorem 23.3.1. Let P be a differential operator of order m with C® coef- 
ficients in the C® manifold X, and let YEX be an open subset. Let 
peC°(X,R), 6'+0 in X, and set 


X,={xeX; d{x)>0}, Xy={xeX; o(x)=0}. 


Assume that for every feC2(X) with supp f<X , one can find ueQ'(X) with 
suppucX, and Pu=f in Y. If yeXynY and £€T,*(X) it follows then that 
the characteristic equation p(y, &+1 '(y))=0 in t has only real roots unless it 
is fulfilled for every t. 


Proof. The hypotheses remain valid if we shrink X and Y, so there is no 
restriction in assuming that X CR", ¢(x)=x,, and y=0. Repeating an argu- 
ment in the proof of Theorem 10.6.6 we shall prove that if KEY is a com- 
pact neighborhood of 0 then there exists an integer N such that for some C 
(23.3.1) K(folsc » sup|D*f| >° sup |D* P* v|; 

lal s [BISN Xn>0 


seer, ) veCe (kK). 


Here (f,v)=| fodx and P* is the adjoint of P with respect to this scalar 
product. To prove (23.3.1) we consider (f,v) as a bilinear form on all feC”? 
with bounded derivatives and supp fcR", while veC#(K). Such functions f 
form a Fréchet space with the topology defined by the semi-norms 
sup|D*f|, and the functions v form a metrizable space with the semi-norms 
sup |D* P* v|. When f=Pu in Y we have 


Xn>O 
(f, 0) =(u, P* v). 


If x,20 in suppu this proves the continuity with respect to v for fixed f, in 
view of Theorem 2.3.10. Thus (f,v) is separately continuous, hence con- 
tinuous, which proves (23.3.1). 

Assume now that for some ¢’=(€,,...,€,_,)€IR"~* the equation 


p(0, 6,1) = 
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has a root with Imt+0 but is not identically valid. Then €'+0 and replac- 
ing (€,t) by —(&',1)/Imt we may assume that Imt=—1. Taking <x’, é 
+x, Ret as new x,_, coordinate while x, is unchanged we may also as- 
sume that t= —i and €’=(0,...,0,1). Thus 


(23.3.2) POO, .--50, 0, 15S =(Cn FES, 1 OS 15 Sy) 


where r is the multiplicity of the root —i, thus Q(1, —)+0. 

To show that (23.3.1) cannot be valid we shall choose f and v con- 
centrated close to 0. With an integer v>r we therefore introduce new coor- 
dinates by 

yj=xjp", jSn—2;  yj=xjp?, j=n—1,n, 


where p is large. This blows up any neighborhood of 0 so that it contains 
any given compact set when p is large. P* is replaced 


P*(y, Dy) = P*(y1/p", «+5 Yn—2/P”s Yn—1/P 7s Val Ps 
p’D,,...5p"D,_2,p°"D,_ 1,7” D,). 
From (23.3.1) it follows with k=v(n+2+4N) that 
(23.3.1) (ZolsSCp* ¥ sup|D*f| ¥ sup |D? P* vl, 


lal SN \BISN ¥n20 
when fe C*(R") and veC?j(M) where M is a fixed compact set. As p00 
we have 
p 7" P*(y, Dy) — p(O,. o ,0, Dn—-1, Dn) = O(p”) 


where O(p~’) means that the product by p” is a differential operator with 
coefficients bounded in C® as poo. In the main term given by (23.3.2) the 
factor D,—iD,_,= —(0/dy,_,+i0/dy,) is essentially the Cauchy-Riemann 
operator in y,_,+iy,. Choose an analytic function W(y,_,+iy,) which van- 
ishes at 0 and has positive imaginary part when y,20 and 0<|yi<1. We 
can take 

w= (yn ti y,)+iVn_1 +iy,)?- 


If we put 
n—2 


1 


it follows then that (D,—iD,_,) ¥(y)=0 and that Im ¥(y)=|y|? when |y|/<1 
and y, 20. 
Now we shall construct a formal series 


u,(Y) anes vy)p! 


where v,EC9, vo(0)=1, such that P*u,=O in a neighborhood of 0. Since 
v>r the lowest term in P*u, is p?"’*™"~" times a(D,—iD,_ ,)' 09. Thus we 


require a(D,, =. iD, _ ni Vo = 0, 


402 XXII. The Strictly Hyperbolic Cauchy Problem 


where a=Q(0¥/dy,_,,0¥/dy,)+0 in a neighborhood V, of 0. Choose 
xeECP(Vo) equal to 1 in another neighborhood V, of 0 and set v)»=y. The 
subsequent equations are of the form 


(23.3.3) a(D,—iD,_,)'0;=F, 


where F; is determined by the preceding amplitudes vp,...,v,;_,. Repeated 
solution of Cauchy-Riemann equations gives a solution of (23.3.3) which we 
then cut off by multiplication with y. The equation remains valid in V,, and 
v,E Cg (Vo). 

If we choose for u a partial sum uf of the series u, of higher order, it is 
now clear that the second sum on the right-hand side of (23.3.1)' tends to 0 
as p— oo, like any desired power of 1/p. Choose FEC? (IR",) and set 


f(y) = F(p y) p”. 


Then the right-hand side of (23.3.1)' tends to 0 as p— oo if p is large enough, 
and 


§ £0) u(y) dy > [ FG) ems +0 dy, 


If F is chosen so that the integral on the right-hand side is not 0, it follows 
that (23.3.1) cannot hold. This completes the proof. 


From the constant coefficient case (see Section 12.8) we know that no 
stronger conclusion on the principal symbol at a point in X would be 
possible in Theorem 23.3.1. In particular, X may be characteristic. How- 
ever, in the strictly hyperbolic case it follows from Theorem 23.2.7 that there 
is a finite propagation speed for the support of the solution to the Cauchy 
problem, just as in Theorem 12.5.6. We shall now prove that this is only 
true in the non-characteristic case. To simplify the statement we place our- 
selves in R" right away and write x’=(x,,...,%,_ 1). 


Theorem 23.3.2. Let P be a differential operator of order m with C” coef- 
ficients in X CR", let 0<BSA, and let Y<X be an open subset containing 0. 
Assume that for every feC(X) with x,2A\x'| in suppf one can find 
ueQ'(X) with x, 2 B\x'| in suppu such that Pu=f in Y. Then the plane x,=0 
is non-characteristic at 0 provided that p(0,€) is not identically 0. 


Proof. Let K,={x; x, 2t|x'|}. The hypothesis implies that if KE Y is a com- 
pact neighborhood of 0 then 
(23.3.4) (ZviIsSC ¥ sup|D*f| SY sup|D? P* o| 

lal SN \BISN Ke 


if ve Co(K) and feC°(IR"), supp f<K,. This follows by an obvious modifi- 
cation of the proof of (23.3.1) which we leave as an exercise. If p(0,...,0, 1) 
=0 we can write 


p0,O=Safeyer 
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where q, is homogeneous of degree j, g,=0 and r>0. Now we introduce the 
new coordinates y=x/p” where v>r. We have 


p-™” P*(y,D,)=>q,(D') D7 4+O0(p~”), 


and (23.3.4) is replaced by a similar estimate for P* with a power of p in the 
right-hand side. We choose ¥(y)=iy, now and determine a formal solution 


BOE vy) p72 


of the equation P*u,=0 in a neighborhood of 0. The leading term in the 


equation is s 
: q{D')v,=0 


so we can take v,»ECP(V,) equal to 1 in a neighborhood V, of 0. The sub- 
sequent equations are of the form 


q,(D')vj=F, 


where F; is determined by the preceding amplitudes. They can be solved 
successively in V, with vjeC@(Vo). The proof is now completed just as that of 
Theorem 23.3.1, with Fe C(K 4) of course. 


We can now motivate Definition 23.2.3 of strict hyperbolicity better. First 
recall that by Theorem 10.4.10 the strength of a differential operator with 
constant coefficients is independent of the terms of lower order if and only if 
it is of principal type in the sense that the differential of the principal sym- 
bol is never 0 outside the origin. We shall devote Chapters XXVI and 
XXVII entirely to such operators. The following proposition examines the 
combination of this principal type condition with the necessary condition 
in Theorem 23.3.1 imposed in all of X ,. 


Proposition 23.3.3. Let X be a C® manifold, let @eC°(X) and let 
peC*(T*(X)) be a polynomial along the fibers such that 

(i) dp(x, €)+0 in T*(X) 0; 

(ii) when $(x)20 the equation p(x,€+1 $'(x))=0 in t has only real roots 
unless it is satisfied for every Tt. 
Then it follows that when (x) 2 0 the equation p(x,€ + 7¢'(x)) =0 cannot 
have a root t of finite order k>1 unless k=2, 6(x)=0, and 


(H} b) (x, €+7'(x)) >0. 


Proof. There is nothing to prove unless #’(x)+0. We can then choose local 
coordinates so that #(x)=x,, say. If @(x)>0 then Lemma 8.7.2 with 6 
=(0,...,0,1) shows that if t is a zero of order k of p(x,€é+17¢'(x)) then all 
derivatives of p of order <k vanish at (x,é+7¢'(x)). (Note that the proof of 
Lemma 8.7.2 only requires analyticity in the @ direction.) Hence k=1 by 
condition (i). If ¢(x)=0 we can only conclude that the derivatives of p of 
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order <k with respect to the variables other than x, vanish at (x, €+71 ’(x)). 
If k+1 it follows from (i) that @p/éx,+0. When (s,t) +0 we have 


p(x',s,€,€,+t+t)=ast+bet* + O(s? +|s t+ |el***) 


where a=p,,(x,€+1 ¢'(x)) and b= p(x,E+7'(x))/dE/k!. By condition (ii) 
there must be k small real zeros t for all small s=0. Replacing s by e*s and 
t by et we conclude as in the proof of Lemma 8.7.2 when ¢—0 that the 
equation as+bt*=0 has k real roots t for every s2>0. Thus k=2 and ab<0. 
Now 

H,x,=6p/06,, H;x,= —Op/dx,0 p/dg, at (x, +7 (x), 


Pp 


which completes the proof. 


In Section 23.4 we shall prove that the results of Section 23.2 remain 
valid with very small modifications if strict hyperbolicity is relaxed as in 
Proposition 23.3.3 on the manifold carrying the Cauchy data. However, at 
genuine double characteristics where dP,=0 one can usually prove neces- 
sary conditions on the subprincipal symbol analogous to Corollary 12.4.9. 
There is also an extensive literature on their sufficiency. It is perhaps partic- 
ularly interesting that the Cauchy problem is solvable for arbitrary lower 
order terms for some hyperbolic operators with double characteristics. 
However, it would take us too far to develop these results here, and we 
shall content ourselves with references in the notes. 


23.4. Hyperbolic Operators of Principal Type 


In this section we shall extend the results of Section 23.2 to operators P 
satisfying the conditions in Theorem 23.3.2 and Proposition 23.3.3. If d(x) 
=x, and X is a neighborhood of 0 in R", this means that the principal 
symbol is assumed to satisfy the following conditions: 


(i) pl, C= 2 pl CO) eis Pa 13 


(ii) when xeX, x,>0 and ¢’eIR"~' 0 the equation p(x,é)=0 has only 
simple real roots €,; 

(iii) when xeX, x,=0 and &’eR"~' 0, the equation p(x,é)=0 has only 
real roots, and dp/0x, 07 p/0é2 <0 at the double roots. 


Suppose that ¢,=t is a double root when x’=x), x,=0 and (=. 
Then it follows from the Weierstrass preparation theorem and the homo- 
geneity that in a conic neighborhood of (xo,0,¢') we have 


P(x, 6) =((E, — a(x, 6)? — B(x, 2) a(x, 6) 


where a, beC™ are homogeneous of degree 1, 2 with respect to &’, b(x, &)20 
when x,20, and q is a polynomial in €, of degree m—2. Moreover, b=0, a 
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=t and db/dx,>0 at (x5,0,€5), so b(x,é)2cx,|é?. We shall begin by a 
study parallel to Section 23.1 of such second order operators, which are 
differential operators in x, but pseudo-differential operators in x’. As in 
Lemma 23.2.1 we shall then return to the original operator. 

Before proceeding we give an important example of a differential opera- 
tor satisfying the preceding conditions which it is useful to keep in mind. 


Example 23.4.1. The Tricomi operator 
n~1 
De =x. ¥ DF 
1 


is strictly hyperbolic for x,>0 and satisfies the conditions (i)-(iii) above 
when x,=0. When x, <0 it is elliptic. 


Lemma 23.4.2. Let acS'(IR" x R"~'), beS?(R" x R"~?), ImaeS°(R" x R"~ 4), 
Im beS'(IR" x IR"~'), and assume that 


(23.4.1) XP SC, Rebs V+ CHE), OSX, SL 


Set P=(D,—a(x, D’'))* —b(x, D’). If OSt<TS1 and ueCP(R"), u=D,u=0 
when x, =t, it follows that for every seIR 


FE T 
(23.4.2) AS (lull, a+4D,ul&)er7" dx, S Che? || Pull{, dx,. 

t t 
Here || |, is the H,,. norm in IR"~-' for fixed x,, and C is independent of s 
when A> A,. 


Proof. A change in the lower order terms changes the right-hand side by an 
amount which is O(1/A) times the left-hand side. The statement is therefore 
independent of the lower order terms. It is convenient to choose them so 
that A=a(x, D’) and B=b(x, D’) are self-adjoint. By the sharp Garding in- 
equality (Theorem 18.1.14) we can also assume that b(x’,x,, D’)20 for every 
x,€[0, 1], and we may strengthen (23.4.1) to 


(23.4.1) x, (JE? +DSC, Reb(x,&), OFx,S1. 


As in the proof of Lemma 23.1.1 the independence of lower order terms 
shows that it suffices to prove (23.4.2) when s=0. 

Set w=e"?"/x,, and form with scalar products in L(G), G 
={xelR";t<x,<T}, 


2Im(W(D, — A) u, Pu) =2 Im (W(D, — A) u, (D, — A)? u)—2 Im (W(D, — A) u, Bu). 


With v=(D, —A)u the first term can be written 
2 Im (wv, (D, —A)v)=2 Im (Wp, D,, v) =f wa, |v]? dx 
G 


= ~ Jlol? yf dx+ flo, T)?W(T) dx’. 
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Both terms are non-negative and we shali drop the second one. Next we 
observe that 

~—2Im(p(D, — A)u, Bu) =i([By, (D, — A)] u, u) + (w Bu(., T), u(., T)). 


The last term is non-negative since B is. The symbol of i[Bw, D,—A] is, 
apart from terms of order 1 multiplied by yw, 


—0(by)/ex, + {a, b} p=2Aby + w(b — x, (b/0x,, — {a, b}))/x,. 


If we apply Taylor’s formula to Reb((x, ¢’)—sHz,_ peg), OSSSx,, recalling 
that Re b(x’, 0, &)20 by (23.4.1), it follows that 


Re (b —x,(0b/0x,, —{a, b})) = —C(1 +161?) x7 — Cx, (1 +161). 
Hence (23.4.1)' gives 
Rew (2Ab+ (b —x, (6b/éx,, — {a, b}))/x,) 

Be~*4(1 + [€'17) (24/C, —C)—C(1 + le) v 
When 2>CC, we have 24/C, -C24/C,. By the sharp Garding inequality 
applied to the quotient by w’ for fixed x, it follows that 
(23.4.3) Rei((By, (D,—A)] u,w)—A/C, fe-24 Well 2 ax, 

t 
T 


= —C Jl tell csy ello) 4x, 
t 


We estimate the right-hand side by the inequality between geometric and 
arithmetic means. Summing up, we have now proved 


(23.4.4) Jen?o(x52+2Ale)(D,—A)uP dx +4/2C, fe“ xa a 
t 
S2Jer*™\(D, —A)u||Pul dx/x, 
+C/a JO/%, +x, 7) e724 |ul? dx. 
We estimate the first integral on the right-hand side by the inequality be- 
tween geometric and arithmetic means so that half of the first term on the 
left is cancelled. Now an integration by parts similar to those above gives 
P = JerA rey + Alp +2A/x3 +3/x4) |ul? dx 
<Im — fer x, + 1/x3)(D, —A) utidx 
s1(fe" x, + 1/x;)(D, — A) ul? dx)* 


where we have used the Cauchy-Schwartz inequality. Hence I? can be esti- 
mated by the left-hand side of (23.4.4). We have therefore proved a stronger 
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estimate than (23.4.2) for s=0 


(23.4.5) fe-tn((i3 pe, + L/h) [lull fo) + Allull yt Gey? + 4/x,) ID, — A) ull’) dx, 
‘ T 


SC fe~?*"|[Pull2, dx,. 
t 


(A similar estimate is of course valid when s+0.) The proof is complete. 


We shall also need an estimate for the solution of the Cauchy problem 
for decreasing x, : 


Lemma 23.4.3. With P, t and T as in Lemma 23.4.2 it follows if ueC> (IR") 
and u=D,u=0 when x,=T that 


T 
Can8) JQ? lull, + Axe (lula 1) + 1D, lle) e?* dx, 
t 


T 
\ Chet ex? | Pulidx;, 
t 
when A> A,. 


Proof. We can essentially repeat the proof of Lemma 23.4.2 taking y= 
—x,e*" now. The independence of lower order terms is again clear, so we 
can take s=0 and A,B self-adjoint, B20. Now 


2 Im (v, (D, — A) v)= —J vl? pdx —flv(’, |? ()dx’ 
G 
where —y’=(2Ax,+1)e?**" and the second term is also positive. The next 


partial integration works as before, and the symbol of [iBy,D,—A] is, 
apart from terms of order 1 multiplied by y, 


—2Aby —W (0 +x, (0b/0x, — {a, B}))/%p. 


Using Taylor’s formula as before we find that the real part is bounded from 
below by 


e?*(2AC, — Chaz (1 + |é/|?) — Can(d + [€'))- 


When 2>CC, it follows by the Fefferman-Phong inequality (Theorem 
18.6.8) that 


‘ T 
(23.4.7) f e?(2Ax, + 1)(D, —A) ul? dx +.2/2C, f x2 l\u(., x, 12, e274" dx, 
G t 


T 
S24 \(D, —A)ul |Pul x, e?*"dx+ Ca | |lull%) e?* dx, 
We have : : 


T 
1=2A § e?*|\u]2,,dx $2 Im | e?*a(D, — A)udx 
t G 


S(21/A)*(J\(D, — A) ul? e? dx), 
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hence 
IA S29 \(D, —A) ul? e?*" dx. 
re 


Using Cauchy-Schwarz’ inequality in the right-hand side of (23.4.7) and es- 
timating the left-hand side from below by means of this estimate for I, we 
obtain (23.4.6). The proof is complete. 


Let us now return to the differential operator P of order m in the open 
set X CR", with principal symbol p satisfying conditions (i)-(iii) discussed at 
the beginning of the section. Let YE X. 


Lemma 23.4.4. If O0S$t<TS1 and ueC(Y) it follows that for every seR we 


have 
T m-1 T 
(23.4.8) § ¥ [D™-*-*ull2 dx, SC, flPull2,dx,, 
t 0 t 
if Diu=0 when x,=t and j<m; 
T m-1 T m-2 
(23.4.9) f Det —* alle x2 dx, +f > [Ret alle dx, 
t 0 t 0) 


T 
<C, || PullZ, x? dx, 
t 


if Diu=0 when x,=T and j<m. 


Proof. The proofs are quite similar starting from Lemmas 23.4.2 and 23.4.3 
respectively in cases (23.4.8) and (23.4.9), so we shall only discuss (23.4.8). By 
Lemma 23.2.1 it is sufficient to prove (23.4.8) where t and T are small, so we 
may assume that X and Y are products of open sets X, and Y, in IR"~* and 
intervals I, Iy on the x, axis. We can choose a covering of Y, x (IR"~*\0) 
by finitely many open cones J; such that 


N 
PQ O=[]aes.0s Eel, xeTys 


where each qi is either of the form &,—A(x, €’) or of the form (€, —a(x, €'))” 
—b(x, €’) with a, b real valued, b(x, &’)=cx,|é'|?. The factors qg/ have no zeros 
€, in common. The functions A, a and b are only defined locally at first but 
can be extended to the whole space. For the corresponding operators we 
obtain a priori estimates from Lemma 23.1.1 in the first order case and from 
Lemma 23.4.2 in the second order case. Choose x,;¢C® with support in I; 
and homogeneous of degree 0 in & when |é|>1 so that 


Vx 6)=1 when |é|>1. 
In supp z,; the polynomials 


ET] a(x, 8, x <deg qi, poliasN, 
vEu . 
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form a basis for polynomials of degree m—1 in €,. Using the equation 
Py {x', D')u= x(x’, D')(Pu)+LP, x)’, D’))]u and factoring P on the principal 
symbol level in I;, we obtain essentially as in the proof of Lemma 23.2.1 
after adding the estimates obtained for x,(x’, D’)u 

T m— 


1 
(A-C)J > jpr-*-*ull2 yer? dx, 
a) 


T T m-1 
SC \Pul pet dC A). Se al Cea rd xs 
t t 0 


if Diu=0 when x,=t and j<m. The last sum can be estimated by 


T m-1i 
VS) (DER Fale pepe a, 
t ce) 


In the term with k=0 we write D”™u=Pu+(D"—P)u and observe that 
m—1 
|Dvru —Pull2_ ns Cc > [Dek : GE ays 
0 
The estimate (23.4.6) follows now when 2 is large enough. The remaining 
details are left for the reader to avoid too much repetition of the proof of 
Lemma 23.2.1 (see also the proof of Theorem 28.1.8). 


The preceding lemmas allow us to extend Theorem 23.2.4 and Theorem 
23.2.7 as follows: 


Theorem 23.4.5. Let P be a differential operator of order m with C® 
coefficients in the C® manifold X, and let YEX be an open subset. Assume 
that P is strictly hyperbolic with respect to the level surfaces of p¢C*(X,R) 


In 
X, =(xeX; $()>0}, 


that P remains of principal type in 
X= {xEeX; G(x) =O}, 


and that X, is non-characteristic with respect to P. If f eH (X) has support 
in the closure of X, one can then find UueH eS, m—1)(X) with support in the 
closure of X, so that Pu=f in Y. This determines u uniquely in a neigh- 
borhood of X,Y. 


Proof. We just have to supplement Theorems 23.2.4 and 23.2.7 with local 
existence and uniqueness theorems at X,. We may therefore assume as 
above that X c]R" and that @(x)=x,. Let YE X and choose M so large that 


p(x,¢)+0 whenxeY and [,/2MIe', ¢+0. 
If K is a compact subset of Yn X, then 


K,={x;0S|x,|ST |x’—y'|SM|x,| for some y'eK} 
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is contained in Y if T is small enough. Let feCP and R", nsupp f cK,. If 
0<t<T it follows from the existence Theorem 23.24 and the uniqueness 
Theorem 23.2.7 that there is a solution u,eC>(Y) of the equation Pu,=f for 
t<x,<T such that Diu,=0 for j<m when x,=t. (Heré we use that the 
supporting planes of the cones {x; M(x, —y,)<|x’—y’]} are non-characteris- 
tic in Y) For u, we have the estimate (23.4.6) which is uniform in t when 
t-0. Hence we can find a limit u as t-0 which satisfies the equation Pu=f 
for OS<x,<T and has zero Cauchy data when t=0. It can be estimated by 
(23.4.8) with t=0. Hence a solution ueH,,,_1,, exists for all feH io. with 
support in the interior of K,. The proof of local existence is now completed 
by repeating that of Proposition 23.2.6. 
To prove local uniqueness we argue similarly. Set 


M,={x;0Sx,ST; |x’-—y'|SM(T—x,) for some y'eK}. 


We have M,cY if T is small enough. By Theorems 23.2.4 and 23.2.7 we can 
for any g € C>°(Mr) and t € (0,T) find a solution v of the equation P*v=g 
when t<x,ST with zero Cauchy data for x,=T and suppvcM,. We can 
estimate v by means of (23.4.9) with P replaced by P* and conclude that all 
derivatives are continuous up to the boundary plane X, since P*v=geC®” 
also. If Pu=0 in Y and suppucX, UX4, it follows that 


(u, g)=(Pu, v)=0. | 
Hence u=O in the interior of M,, which proves the uniqueness. 


Remark. There is no difficulty in discussing the Cauchy problem with in- 
homogeneous Cauchy data of the form (23.2.7). However, we must require 
half a derivative more for y, than in Proposition 23.2.5 in order to be able 
to reduce the proof to Theorem 23.4.5. 


We shall now prove an extension of Theorem 23.2.10. The main step is a 
microlocal version of Lemma 23.4.2. 


Lemma 23.4.6. Let a, b, P be as in Lemma 23.4.2, let qeS° be real valued, 
q(x, €')=0 when x,>T, and assume that 


(23.4.10) |6q/Ox'| +|eq/6e"\(1 +16'l) S —60q/0x,. 


Set Q=q(x,D’). If T and 6 are sufficiently small, it follows that for every 
seER and ue S(R") with u=D,u=0 when x,=0 we have when A> A, 


rE: 
oe) AS (Oullé 1+ HOD, ullgeW?" dx, 
0 
T 
SC, f(QPull2, + lPullg_y)e7?*" dx,. 
0 
Proof. From Lemma 23.4.2 we know that 


T T 
AS (lulle4 a +1D, ullg)e7 7 dx, SC f | Pullg_ e774 dx,,. 
0 0 
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For large 4 the lower order terms in P and in Q can therefore only change 
the constant in (23.4.11). If we replace u by (1+|D']’)~‘/?u and observe that 


(1+(DP?P+D'P)%?—P,  (1+ID'PY?QU+|DP)-*? -@ 
are of order 1 and —1 respectively, it is clear that it suffices to prove 
(23.4.11) when s=0. We may also assume that A=a(x, D’)) and B=b(x, D) 
are self-adjoint, that (23.4.1) holds and that B20. We write Q=(q(x, D’) 
+q(x, D’)*)/2. With scalar products in 17(R”,) and p(x)=e7?*"/x, we form 
2 Im (WQ?(D, —A)u, Pu) =2 Im (WQ?», (D, — A) v) —2 Im (WQ?(D,, — A) u, Bu) 
where v=(D, —A)u. Integrating first for x,>t>0 we obtain when t>0 
(23.4.12) 2Im(WQ?»,(D,—A)v)=i((D, — A) v, WO? v) —i(WQ?v,(D, — A) v) 
=(—wW'Qv, Qv)+i(w[Q?, D, —A] », »). 


Since —w’=e—?*"(21/x,+x, 7) the first term gives control of Qv. Next we 
observe that 


(23.4.13) —2 Im(WQ?(D, —A)u, Bu) 
=i(WQ?(D,, — A) u, Bu) —i(Bu, Q7(D, — A) u) 
=i(W[B, Q*] v,u)+i([Q’, D, — A] W Bu, u) 
+i(Q?[wB, D,—A]u, u). 
The terms in (23.4.12) and (23.4.13) containing a commutator with Q? are 
(23.4.14) i(W[Q, D,—A]v, v)+(WLB, Q7] », u) + [Q?, D, — A] Bu, u)). 

With U=(v,(1+|D'|*)*u) this is of the form (/W(x, D')U,U) where W is 
vector valued of order 0 with principal symbol 
Cs f,—4} — {b, q7}/216'| ) 

{b,q73/2|8| {97 €, a} BNE PL 
It is positive semi-definite if 
I{b, q7}/2| =| {b, a}|S{q", €, —a} b* =2q( —0q/0x, — {g, a}) b?, 
that is, 
{b, g}| $2b?( —0q/6x, —{, a). 


This inequality follows from (23.4.10) if Lemma 7.7.2 is applied to b. Thus 
the sharp Garding inequality for systems gives 


T 
Re(WW(x’, D)U, U)Z —C J W(loll_y, + lula) ax, 
0 


T 
2—C/Aj\Pullt_,e7?dx,. 
0 


The last estimate follows from (23.4.5) (with the order of the norms lowered 
by one half). 
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The last term in (23.4.13) can be written 


i(Q*[WB, D,—A]u, u)=i([WB, D, — A] Qu, Qu) +i(Q[Q, [WB, D,—A]] u, u). 


The principal symbol of iQ[Q, [WB,D,—A]] is purely imaginary and of or- 
der 1, hence the self-adjoint part is of order 0 and 


(234.15) |Rei(Q,[Q,[WB, D,—A]] u, Qui SC JW lull dx,. 


Since ull) lull gy lull_4) we obtain using (23.4.5) with the order of the 
norms lowered by one half 


T 
(23.4.16) AAS hh ilull2,, dx, SC fl Pull2_ en? dx,. 
0 


The term Rei([wB, D,—A]Qu, Qu) is estimated from below by (23.4.3) with 
u replaced by Qu. Summing up, we obtain 


T T 
fl) |Q(D, —A) ull? dx, +2/2C, fe OullZ,, dx, 
0 0 
<2 fe? |Q(D, —A) ull |QPull dx,/x,+ CA~* [IPull?_4,e~2**dx, 
0 0 
ci" [Ay xs 2)? e— 24%" Oull? dx,. 
0 


In the first term on the right-hand side we use the Cauchy-Schwarz in- 
equality so that one term which occurs is at most one half of the first term 
on the left. In the half remaining after cancellation we use that Q(D, —A) 
=QD,—AQ+[A,Q]=(D,—4)Q+[Q,D,—A] where the commutators are 
of order 0 so that [A,Q]u and [D, — A,Q]u can be estimated using (23.4.16). 
The last step of the proof of Lemma 23.4.2 is now applicable and completes 
the proof. 


Lemma 23.4.7. If a,b, P are as in Lemma 23.4.2 and | feHo, GR",) then the 
equation Pu=f has a unique solution ueH,, ,_1)<CH,,) when x,<4, such 
that u=D,u=0 when x,=0. If Q satisfies the conditions in Lemma 23.4.6 
with T=! then QueAy, if OFA 0,644) 


Proof. a) Existence. The estimate (23.4.2) extends by continuity to all 
ueH,, .(R".), with t=0 and T=1, so it suffices to prove the existence for f 
in a dense subset of Hi, say all feHyo,,, 1). The equation Pu=f with the 
homogeneous Cauchy boundary conditions means that with L? scalar pro- 
ducts in R”. 


(23.4.17) (u, P*v)=(f,v), veC?(X,), 


where X,={xeR"; x,<1}. If feAio,,,,, then it follows from Lemma 23.4.3 
that 
IF, “SNF IleosesylPlleo, —s— yelfiloss 1» |P* ello, -s—1 
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where the norm on P*v is taken with respect to the half space IR’, Hence 
the Hahn-Banach theorem shows that (23.4.17) is satisfied by some 
ucHo,, +1) QR"). By Theorem B.2.9 it follows that yu has the required prop- 
erties if yeC”(—1, 1) and y=1 in (—4,4). 

b) The estimate (23.4.2) with s replaced by s—i is valid for all 
ueH,, ,_ 1, which gives the uniqueness immediately. 

c) Assume that Of€Ho,,, 4). Choose yeC$(R"~') with [ydx'=1 and let 
f,=2(eD')f be the convolution with e!~"y(x’/s). Then f,€H 0, for every o, 
so the Cauchy problem Pu,=f, with zero Cauchy data when x,=0 has a 
solution when x, <4 satisfying (23.4.11) with s replaced by s+4. Since 


Of,=X(eD') Of +10, HED] f 


and the commutator is bounded in OpS~' for 0<e<1, it follows that Qf, is 
bounded in Hio,,,;) when O0<e<1. Hence (23.4.11) shows that Qu, is 
bounded in Hi;.44). We have u,>u in H(,,, when ¢>0 so it follows that 
QueH,, ,,3), which completes the proof. 


We can now prove an extension of Theorem 23.2.10. 


Theorem 23.4.8. Let P satisfy the hypotheses of Theorem 23.4.5, let fe WN (X,), 
ueD'(X ,), Pu=f in X, and vuly,=,, j<m, where v is a C® vector field 
transversal to X. Then the extension tic V (X ,) of u satisfies 


(23.4.18) WE,(U)I x “WE lxoV (U WF (6). 


Proof. As in the proof of Theorem 23.2.10 we just have to verify that the 
left-hand side of (23.4.18) is contained in the set on the right. To prove this 
local statement we may assume that X CR", $(x)=x,, v=6/0x,, ieé'(X ,), 
and that (x, je WF(u), x,>0, imphes €’+0. At first we also assume that m 
=2 and that P satisfies the hypotheses in Lemma 23.4.2; thus we allow P to 
be pseudo-differential along the boundary. 

Let (y’, 7')eT*(IR"~')\0 belong to the complement of the right-hand side 
of (23.4.18). Choose a conic neighborhood WcIR" x (IR"~!~0) of (y’,0,7) 
such that {(x’,é’);(x’,0,é)eW} does not intersect the right-hand side of 
(23.4.18) and (x, é)eW, x,>0 implies (x, ¢é,¢,)¢WF(S), ¢,eIR. By a slight 
change of the proof of Theorem B.2.9 (see the proof of Lemma 24.4.6) it 
follows from the equation Pu=/ that we can find s so that 


u(x, D'}ueH ys 
for all yeS° with cone suppyc W. Set U=y(x, D’)u. Then 
F=PU=y(x, D’))f +P, x(x, DY] ueH 0,5) 


and 1%, D')o,eCo. If Q satisfies the conditions in Lemma 23.4.6 and y=1 
in a neighborhood of suppg, then QFeH,,,, for all a. since QPy(x, D’) 
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—Qyz(x,D'))P is of order —oo tangentially. Hence Lemma 23.4.7 gives 
QUeH 4) which implies QueH,, ,. 4). 
The condition (23.4.10) is satisfied by 


q(x, €)= We —x,/é — |x! -y'? 161 +1817)? = '/in'I 1?) 


if ¢ is small enough and weC?(—1, 1). The support lies in W when j¢’| is 
large. If we iterate the preceding argument for functions x(x, &) of this form 
only we find inductively that q(x, D'ueH,, for all such g and all s. Using 
partial hypoellipticity again (see Theorem B.2.9 or Lemma 24.4.6) we con- 
clude that q(x, D')ueC®, hence that (y’, 7)¢WF,(u). 

For arbitrary'm we can factor P as in the proof of Theorem 23.2.10 as a 
product of first and second order factors with an error of order —o 
tangentially. Strictly speaking we only have such a factorization in a small 
conic neighborhood of (y’,0,7’), but that is all one needs to apply the pre- 
ceding argument to remove the factors successively. No new ideas are re- 
quired in the proof so we leave the details for the reader. 


Notes 


The Cauchy problem for strictly hyperbolic second order equations was first 
solved by Hadamard [1] with his parametrix method. (See Section 17.4 for 
operators of the form D?—a(x,D) where a is elliptic. The general case is 
similar since as explained in Section C.6 geodesic normal coordinates can be 
introduced in general.) A variant of the method was developed by M. Riesz 
[1]. Another approach depending on L? estimates was introduced by Fried- 
richs and Lewy [1] who transferred the idea of energy estimates from the 
theory of Maxwell’s equations. Thus the estimates were proved by a partial 
integration. However, the extension of the estimates to higher order hyper- 
bolic operators found by Petrowsky [1,5] relied on quite complicated 
Fourier analysis techniques. The simpler energy estimate method was ex- 
tended to higher order operators much later by Leray [1] and Garding [3]. 
Shortly afterwards the Fourier analysis approach of Petrowsky reappeared 
in the guise of singular integral operators. (See e.g. Calderén [2], Mizohata 
{2,3].) We have here steered a middle course, using energy estimates for 
first order operators in Section 23.1 and then a factorization of higher order 
operators into first order pseudo-differential factors in Section 23.2. 

Both in Section 23.1 and in Section 23.2 we study the singularities of the 
solutions of the Cauchy problem closely. Such studies were first made by 
Courant and Lax [1], Lax [3] and Ludwig [1] before the introduction of 
wave front sets; the results were then less precise and harder to prove since 
they did not have a local character. In Chapter XXVI we shall prove more 
general results on interior singularities with a more systematic method. 
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The necessity of weak hyperbolicity proved in Theorem 23.3.1 is due to 
Lax [3] for simple roots and to Mizohata (5] in general. The simple proof 
given here is essentially due to Ivrii and Petkov [1]; see also Hérmander 
[36]. Theorem 23.3.2 also comes from Ivrii and Petkov [1]. These papers 
are mainly concerned with conditions on the subprincipal symbol at a 
doubly characteristic point, that is, one where P,=dP,=0. The Hessian of 
P_/2 is there a hyperbolic quadratic form Q by Theorem 23.3.1 and Lemmas 
8.7.2, 8.7.3. A symplectic classification is therefore given by Theorem 21.5.3. 
If no real eigenvalues exist then solvability of the Cauchy problem for P 
requires that the subprincipal symbol of P lies between —)'y, and )u, 
where ip; are the eigenvalues of the Hamilton map on the positive imag- 
inary axis. The proof was given in part by Ivrii and Petkov [1] and 
completed in Hérmander [36]. On the other hand, when the characteristics 
are at most double and real eigenvalues exist for the Hamilton map at every 
double characteristic (the effectively hyperbolic case) then the Cauchy prob- 
lem can be solved for arbitrary lower order terms (see Ivrii [1], Melrose [6], 
Iwasaki [1], Nishitani [1]). Much work has also been devoted to double 
characteristics which are not effectively hyperbolic. (See Lascar and Lascar 
[1] and the references given there.) However, there are no complete results 
yet in the doubly characteristic case. 


Chapter XXIV. The Mixed Dirichlet-Cauchy 
Problem for Second Order Operators 


Summary 


In Section 23.1 we introduced energy integral estimates for first order oper- 
ators only. The passage to higher order operators in Section 23.2 was made 
by factorization. However, as we saw in Section23.4 the energy integral 
method is also applicable directly in the higher order case, and it was orig- 
inally introduced for second order operators. We shall discuss it in that case 
in Section24.1 to derive estimates for mixed problems. For the sake of 
simplicity only the mixed Dirichlet-Cauchy problem will be discussed in this 
chapter. References to work on more general mixed problems will be given 
in the notes. However, the propagation of singularities at the boundary is a 
very intricate matter also in this special case. It will be discussed in Sections 
24.2 to 24.5 by means of a microlocal version of the energy estimates of 
Section 24.1. Section 24.3 is devoted to the geometrical aspects of the flow of 
singularities. Some problems remain concerning bicharacteristics which are 
tangents of infinite order to the Dirichlet boundary. Thus there is a gap in 
the final results on propagation of singularities in Section 24.5. Two other cases 
where these fail are discussed in Section24.6. Both are related to the 
Tricomi equation. In Section 24.7 finally we discuss estimates for operators 
depending on a parameter in order to complete the study in Section 17.5 of 
the asymptotic properties of the spectral function of the Dirichlet problem. 


24.1. Energy Estimates and Existence Theorems 
in the Hyperbolic Case 


Let P be a second order differential operator with C®™ coefficients in a C” 

manifold X of dimension n, with interior X° and boundary 0X. We assume 

(i) P is strictly hyperbolic with respect to the level surfaces of 
peC™(X,R). 

(See Definition 23.2.3.) Let p be the principal symbol. Since P may be 

multiplied to the left by p(x, @’(x))~! we may asusume without restriction 

that p(x, d’(x))>0 for every xeX. The quadratic form p(x,&) can be polar- 


24.1. Energy Estimates and Existence Theorems in the Hyperbolic Case 417 


ized for fixed x to a symmetric bilinear form p(x,é,y); &neT,*. The 
orthogonal plane of ¢’(x) with respect to this form is then supplementary to 
R(x), and the hyperbolicity means that p(x, &) is negative definite there. 
Thus p(x, €) has Lorentz signature: the indices of inertia are 1, n—1. 

With standard local coordinates x,,...,x,, €,--.,6, we shall write 


P(x, C)= Dig (x)E jo, 


to conform with the usual notation in (pseudo-)Riemannian geometry. We 
identify e€7,* and teT,, when p(x, é,y)=<t,m>, 7éT*, that is, in local coor- 
dinates 


t=) g(x) Ey. 


This carries p over to the dual quadratic form in T,(X) 


Y jl) tty 


where (g,,) is the inverse of (gz), This form defines a pseudo-Riemannian 
geometry in X. A tangent vector ¢ at x is called timelike if D Sjelx)t jt, > 0. 
Since <t, o'(x)> = p(x, &, ¢’(x)) if € is the cotangent vector corresponding to t, 
we must have <t, @’(x)> +0 then so the timelike tangent vectors at x form a 
double cone. Those with <t, 6'(x)> >0 will be called forward directed. Their 
closure is a convex cone in which the pseudo-Riemannian scalar product of 
any two non-zero vectors is positive apart from the case of two linearly 
dependent vectors in the boundary. These boundary vectors are isotropic, 
that is, )’g,,(x)t;t,=0. Vectors with >’ g,,(x)t;t,<0 finally are called space- 
like. Hypersurfaces are classified by their conormals v: A hypersurface is 
spacelike (timelike) if p(v)>0 (p(v) <0). 

We shali assume that 
(ii) The map Xaxt- (x) is proper. 
(iii) OX is timelike. 
Thus X,,={xeX; aso(x)<b} is for arbitrary real a<b compact; the 
boundary consists of the two spacelike surfaces @~'(a) and $~1(b) and the 
part of 0X where ax@<b. It intersects the spacelike boundary pieces 
transversally. The condition (iii) means that the pseudo-Riemannian metric 
remains of Lorentz signature when restricted to 0X. 

One of the main goals of this section is to prove the following existence 
theorem: 


Theorem 24.1.1. Letf € H{S°(X°), uo € H(0¢,(OX) where s 2 0, and assume that 
f and uy vanish when <a. Under the assumptions (i)-(iii) above there is then 
a unique ue HS, ,,(X°) vanishing when ¢<a such that Pu=f in X° and u=uy 
on OX. 


Remark. We have only made a statement here concerning the case of 
vanishing Cauchy data. This is not a serious restriction since we already 
know how to solve the Cauchy problem. However, the discussion of com- 
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patibility conditions between uy, f and the Cauchy data in their common 
domain is otherwise somewhat complicated. 


The proof of Theorem 24.1.1 like that of Theorem 23.2.4 will follow once 
we have proved a local existence theorem analogous to Proposition 23.2.6 
and obtained a local uniqueness theorem analogous to Theorem 23.2.7. We 
shall therefore postpone the proof of Theorem 24.1.1 until we have estab- 
lished these local results. 

Thus we assume in what follows that 


P(x, D)=) g/*(x) D;D, + ¥, b (x) Dj +(x) 


where the coefficients are in C™(IR") and constant outside a compact set, 
(g*) is real with Lorentz signature, and g"">0, g'1=—1. Then we have 
hyperbolicity with respect to the planes t=constant if t(x)=x,, and the 
plane x, =0 is timelike. The energy estimate in Lemma 23.1.1 was obtained 
by taking the scalar product of Pu with a multiplied by e~**. Here we argue 
similarly but multiply with a first order derivative of a First note that if g/* 
and f' are real constants then 


2Re > f' 0:4 ¥ g*O,Qu=>, O)(2 Red. g* Qud~f'9;@) —Y OS fig" Oui 
=D a(T]w)f') 
where for a cotangent vector v and f,=)g¢,if° 


(24.1.1) > Tu) fivj,=2Re > gi*d,uv; > gid, f,, —> g*0jud,7> 2" f.y;. 
When f' and g/* are variable there is only a sesquilinear form in the 


derivatives of u to add on the right-hand side. Since )' g*0;d,u+Pu is of 
first order, we obtain the basic energy identity 


(24.1.2) Ydfle(Y Ti) fit filuPyt aye (Y Tw) f' d,t + ul? Ft) 
= —2Ree*¢ fi’) Pute~* R(u) 


where R is a sesquilinear form in (u,u’) which is independent of A. We recall 
that t=x,, hence t’=(0,...,0,1). We shall integrate the identity over a 
quarter space x, >0, x,<T: The first term can be integrated out by means 
of Green’s formula then, which gives a surface integral over each of the 
boundary planes. The essential point is now to examine when the form 
involving T/ is positive definite; we have kept the notation rather more 
general than necessary to make this discussion clearer. 


Lemma 24.1.2. Let L be a real quadratic form of Lorentz signature in a finite 
dimensional vector space V, and let a, b be timelike vectors in the same cone. 
Thus L(a), L(b) and L(a, b) are positive if L(€,4) is the symmetric bilinear form 
with L(é, €)=L(é). Then the quadratic form 


(24.1.3) Vagr2L(é, a) L(Eé, b)—Lé, 8 L{a,b) 


is positive definite. 
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Proof. The right-hand side of (24.1.3) can be written L(a,y) where 4 
=2EL(E, b)—bL(Eé, ©), thus L(y, n)=L(b, b) L(é, 8) 20. If €=1b+0, L(b, 6)=0, 
then 

L(y, b) = 2 L(é, b)* — L(b, b) L(é, €) =(t? L(b, b) — L(6, 6) L(b, b) > 0 


if +0. Thus y is then in the same closed Lorentz cone as b, which proves 
that L(y, a)>0. 


Remark. Conversely, L or —L has Lorentz signature and a, b are timelike 
vectors in the same Lorentz cone if (24.1.3) is positive definite. In fact, 
taking =a or €=b we obtain L(a,a)L(a,b)>0 and Lib, b) L(a,b)>0. After 
a change of sign if necessary it follows that L(a), L(b), L(a, b) are all positive, 
and when L(é,a)=0 we obtain L(é,é)<0 if €+0, which proves that the 
indices of inertia are 1,n—1. 


If f is timelike and directed forward, it follows that the second term in 
(24.1.2) is positive definite in (u,u’). The surface integral obtained over the 
plane x,=T will also have an integrand which is a positive definite form in 
(u,u’) then. With v=(—1,0,...,0) the first order terms in the integrand of the 
surface integral over the plane x, =0 will be e~* times 


DL Tu) f'vj=2 Re plu, v) p@, f) — pu, 2) pv, f) 


where f, is the cotangent vector corresponding to f If u equals 0 when x, 
=(, then u’=tv so this is equal to [t|? p(v, v) p(v, f,)=|t|*,f* which is positive 
if f points toward the half space where x,>0. In that case we have for 
arbitrary ueC’, if wu’ =tv+é where p(v, )=0 
Y Ti) fiv,=2 Re p(tv+é v) plév+ & f)—pltv+ & ivt+ 2 pv, fD 
=|t\?f'-2Retp(é, fJ+PEOSf? 
2? f7/2-21pE A/S +PEOS. — 
Using a partition of unity it is of course easy to choose f so that f1>0 on 
OX and f lies in the forward cone everywhere; we can take feC™ and 
constant outside a compact set. If we integrate (24.1.2) over Q, 
={x;x,>0,x,<T}, estimate the integral of the first term on the right-hand 


side by Cauchy-Schwarz’ inequality, and cancel a constant times the volume 
integral of |u'|?+|u\?, we obtain if weC2(IR") and A is large enough 


(24.1.4) f Au? +|ulen*dx+ f (uP +Iu\)e7 dS 
Or a0r 


SCA“! f [Pulte**dx+C f (iu? +S leyui?) eras, 
2 


Or x1=0,xn<T 


As a first application we prove a uniqueness theorem. 


Lemma 24.1.3. If D*ueI?(Q,), |o|S1, and Pu=0 in Q, while, u=O in 
{x; x,=0,x,<T}, then u=0 in Q, if'u vanishes outside a compact subset of Or. 
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Proof. Choose a so that a+1<x, when xesuppu. It follows from (24.1.4) 
that 


(24.1.4Y j (\v'|? +|v|*)dx SC, f |Pvl|? dx 
Or Or 


when ve C?(Q,,) and v=0 if x,<a or x,=0 or |x| is large. Let weCr(R"~?), 
Jydx=1, and set 
W,=e' "W(x2/€, ..., X,/€) d(x). 


Since the plane x,=0 is non-characteristic it follows from Theorem B.2.9 
that u is locally in H,,, ; _») for every m with respect to the half space x, >0. 
If x,>0 in suppy and ¢ is small it follows that v,=u*y, is in C~(Q,) and 
equal to 0 when x,<a. The support is compact so we may apply (24.1.4Y. 


Since Pu=0 we have 


Pv, =P(u*y,)—(Pu)* => (gD ,D,u* w,)—(g*D;D,u) * b,) 
+> (b(Dju* p,)— (Du) *W,)+cluxW,)—(cu) *W,. 


In the first sum the term with j=k=1 is equal to 0 since g!!= —1. To the 
others we can apply Friedrichs’ lemma (Lemma 17.1.5), for fixed x, at first, 
to conclude that the I? norm over Q, tends to 0 with ¢. This is obvious for 
the other terms since D*ueL? for |a| <1. Hence the norm of Pv, over Q, 
tends to 0 as e0, and since v,>u in LV? norm it follows from (24.1.4) that 
u=0 in Q,. The proof is complete. 


From Lemma 24.1.3 we can’ deduce a local uniqueness theorem which 
combined with Theorem 23.2.7 implies the uniqueness in Theorem 24.1.1. 


Theorem 24.1.4. Let the hypotheses (i), (iii) of Theorem 24.1.1 be fulfilled in a 
neighborhood of xy¢0X. Then there is a fundamental system of neighborhoods 


V of Xo in X such that ueHG(V 0X°), u=0 in VaX°? when $< (xo), Pu=0 
in VAX°, and u=0 in Vn 0CX implies u=0 in V. 


Proof. We can choose local coordinates at x, such that x,=0, x, =0 on 0X, 
and ${x)=$(0)+x,—|x'?. Let V, be defined as in the proof of Theo- 
rem 23.2.7. If the coefficients of P are extended smoothly to a full neigh- 
borhood of 0 it is clear that P is strictly hyperbolic there. With ye CP (IR"), 
OsyS1 and y=1 in a neighborhood of 0 it is also obvious that 


(x/7) P(x, D) + (1 — x(x/9)) PO, D) 


satisfies the hypotheses of Lemma 24.1.3 if y is chosen small enough. When ¢ 
is so small that y(x/y)=1 in V, it follows from Lemma 24.1.3 that the 
uniqueness statement in the theorem holds. 


Our next goal is to derive an existence theorem from (24.1.4). Thus we 
return to the hypotheses on P made in the proof of (24.1.4). If P* is the 
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formal adjoint of P, then Pu=/f in Q,; and u=u,y when x, =0 implies 


(24.1.5) {(P*vjadx= [ vofdx— J d,viigdS 


Or xX4=0,X%,<T 


provided that veC7(R"), x,<T in suppv, and v=0 when x, =0. (Recall 
that g'!= —1.) Conversely, if (24.1.5) is valid with u, fel? for all such v, we 
first obtain Pu=f by taking veCq(Q;7). Thus ueH,, _,) locally which 
makes the boundary values of u when x, =O well defined. From (24.1.5) with 
general v vanishing for x, =0 it follows that u=u, when x, =0. 

Assuming for example that x,20 in supp f and in suppu, we want to 
find u so that x,20 in suppu and (24.1.5) is valid. In view of the duality in 
Theorem B.2.1 this can be done by means of the Hahn-Banach theorem if 
we can estimate the right-hand side of (24.1.5) by means of the restriction of 
P*v to the half space R”,. 

The operator P* satisfies the same hypotheses as P. Replacing P by P* 
and changing the sign of x,, we obtain from (24.1.4) with T=0 


(24.1.4)” A\ (wl +lol7je*™dx+ f  |d,v|/?e**ds 
Q 


x4=0<x, 


SCA-" | |P*v|?2e**" dx, 
Q 


if veCGdR") and v=0 when x,=0. Here Q={x; x,>0,x,>0} and / is 
large. If x, < T in supp v and T is a small positive number we can take A= 1/T 
and obtain 


(24.1.6) To! (lv'2+lv)dx+ §  |d,v/?dS<CeT § |P*v|? ax. 
Q Q 


x1=O0<Xy, 


This is already an estimate of the desired form but to obtain the right 
smoothness properties of the solution u as stated in Theorem 24.1.1 we must 
first pass to suitable H,,) norms in (24.1.6). Set 


E,(D')=((1+ D2+...+D?_,)t—iD,. 
If w(x,,...,x,€C2(R"-1) then the 12 norm || ||, of E,(D’)w in the half 


space IR"~' ={(x2,...,X,), X,>0} is by Theorem B.2.4 equal to the norm in 
A(R" *) of the restriction of w to that half space. 


Lemma 24.1.5. For every real s we have, if veC>(IR") vanishes when x,=0 
and for x,>T where 0<TST, 


(24.1.7) T~* ! NE, 1D) o(x4,-)I4 dx, + [E(D) 2, 000,.)II4 


= CT f |E,P* v(x,,.)I4 ax. 
0 
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Proof. w=E,v belongs to S(R",.) and w=0 when x,>T. It is therefore clear 
that (24.1.6) can be applied to w. We have 
NE, w(xy,-)IZ S20 +D3+...+Di_ 1)? w(x, JF +2 1D, WI 
=2 (¥ ID,w6xy, IIE + Hoven IID, 
2 
so the left-hand side of (24.1.7) is at most equal to twice the left-hand side of 
(24.1.6) with v replaced by w. Now 
P*w=E,P*v—E,[P*, E_,]w. 
Recall that in the principal part }'g/*D,D, of P* the coefficient g'! is a 


constant. This term drops out in the commutator and in the others we have 
at most one factor D,. It follows that 


E,[P*, E_,]=), R,(x, D') D, + Ro(x, D’) 


where R,...,R, are in S°(R"xIR"~'). The operators R,(x,D’) are sums of 
products of the operators E,, E_, and differential operators, so R,(x,D’)D,w 
=R,(x, D')w, when x,>0 if w,=D,w when x,>0 and w,=0 elsewhere. 
Hence it follows from (24.1.6) with a constant C, that 


(T-'-c,1f (» [DCs )14 + hme IHR) doy + 2, w(0,-)112 
0 1 
<2 CeT f |E,P* v(x,,.)|}2 dx,. 
1) 


When T?C,<4 we obtain the estimate (24.1.7) by the remarks at the 
beginning of the proof. 


By duality we can now prove a local existence theorem: 


Lemma 24.1.6. Let s20 and feH,,(R"), uo¢H,,, QR" *), both with support 
where x,20. Then one can find ueH §, ,(Q7,) (with respect to the half space 
H’={x;x,>0}) such that Pu=f in Q7,, x,20 in supp u, and u=uy on the 
part of the boundary where x,=0. 


Proof. By Theorem B.2.4 we have 
WFQ 2 SNES SC? 4x1, Molise y= NES 140 
0 


where || || denotes the 2 norm. If we apply (24.1.7) with s replaced by 
—s—land T=T,, it follows that for some C 


1 


|Jofdx— Jf a,vadSiSC (i JE_,_.Ptole,.JI4 dx,) 
Or 0 


xy=0,x,<T 
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if veC? and v=0 when x,=0 and when x,>T. By the Hahn-Banach 
theorem we can therefore find a linear form L on C? such that 


00 + 
[Lowy] <C (i JE_,. (1,13 dx,) , wee, 
07 


L(P*v)= f vfdx— J A, vty dS, 


Qr x1 =0,X%,<T 


if ve Cf and v=0 when x, =0 and when x,,>T. The continuity means that 


Co 


L(w)= f (w(x,,-), u(x,,.))dx, 
ce) 
where uel?((0, 00); H G 41)0R"~*)). Thus u defines an element in Aes +1(7’) 
such that x,20 in suppu and Pu=f in Q7, u=u_ when x,=0. Since 
fe (H')cHA,._,(H’) it follows from Theorem B.2.9 that ueHs (7) 
(with respect to the half space H’) which completes the proof of the lemma. 


Proof of Theorem 24.1.1. The proof of Theorem 23.2.4 can be repeated with 
the reference to Theorem 23.2.7 (Proposition 23.2.6) replaced by a reference 
to Theorem 24.1.4 (Lemma 24.1.5). The details are left for the reader to 
repeat. : 


The proof of the crucial estimate (24.1.4) can also be adapted to the 
boundary conditions in Example 12.9.14 b) when all inequalities required for 
E and F there are fulfilled in a strict sense. The main difference is that 
another partial integration is required in the boundary plane x, =0 and that 
the choice of the vector field f requires more attention. We refer to Garding 
[6] for details and shall discuss the higher order case in the notes only. The 
arguments used to exploit the energy estimates here are clearly applicable 
quite generally. 


24.2. Singularities in the Elliptic and Hyperbolic Regions 


From now on we shall consider an arbitrary second order differential 
operator P with real principal symbol p and C® coefficients, defined in a 
C® manifold X of dimension n, with boundary 0X. We assume that 0X is 
non-characteristic with respect to P and write X°=X~ 0X. Our purpose is 
to study the singularities of a solution ue. V(X) of the mixed problem 


(24.2.1) Pu=f in X°, u=uy in OX, 
where fe.W(X). In X° we know that WF(u)\ WF(f) is contained in the 


characteristic set p~1(0). By Theorem 23.2.9 this set is invariant under the 
flow defined by the Hamilton field H, of the principal symbol p, if p is 
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strictly hyperbolic. Since only microlocal properties of p are important, it is 
easy to see that hyperbolicity may be replaced by p,+0. (Compare with the 
proof of Theorem 20.1.14.) We leave the proof for the reader for even this 
condition will be eliminated in Section 26.1. 

The purpose of this section and the following ones is to prove analogous 
results at the boundary. By Corollary C.5.3 we can introduce local coor- 
dinates (x,,...,x,) such that X is defined by x,20 and, perhaps after a 
change of sign, 


(24.2.2) P(x, 6)=E7—1(% 6), C= (E25. Sy). 


Recall that T*(X)|,y is projected on T*(6X) with the conormal bundle as 
kernel if we restrict cotangent vectors of X to cotangent vectors of 0X. The 
set Ex T*(0X) of points such that p+0 in the inverse image is called the 
elliptic set of the mixed problem; in local coordinates 


(24.2.3) E= {(x’, e); r(0, x’, ) < O}. 
By Theorem 20.1.14 it follows from (24.2.1) that 
(24.2.4) En WE,(u)=E A(WE,(f) VU WF(uy)), 


so we have complete control of the singularities in the elliptic set. The 
hyperbolic set H is the set of points in T*(@X) such that p has two different 
zeros in the inverse image, that is, in local coordinates, 


H=({(x',&); r(0,x’, €) > 0}. 
The corresponding zeros of the characteristic equation are defined by x, =0 
and €,=+r(x, &)*. The Hamilton field 
H,=26¢, 0/0x, —0r/d¢' 0/0x' + dr/0x d/0g 
points into X (out of X) for the positive (negative) root. Thus the positive 
root defines the initial point of a bicharacteristic which goes transversally 


from 0X into X°, while the negative root is the end point of a transversally 
incoming bicharacteristic. 


Theorem 24.2.1. If (24.2.1) holds and 
(x', ¢')e(H 0 WE,(u))s (WE,(f) VU WE(uo)) 
then the corresponding outgoing and incoming bicharacteristics are both in 


WF(u) in a neighborhood of (0, x’): 


On the other hand, if (x’,£’) ¢ WF,(u) then these bicharacteristics do not 
meet WF (u) over a neighborhood of (0, x’), since WF,(u) is closed. Hence they 
cannot meet WF (u) until they encounter WF (f) (or return to the boundary). The 
theorem is of course a microlocal form of the reflection law of optics. 


Proof of Theorem 24.2.1. We must show that if (0, €))eH x (WF,(f) U WF(u,)) 
and, say, the corresponding outgoing bicharacteristic is not in WF(u) near 0 
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then (0, €5)¢WF,(u). After multiplying u by a suitable cutoff function we may 
assume that u has compact support and that (x, €)eWF(u), x, >0, implies ’ 
+0, which makes Theorem 18.1.36 applicable. We shall use the factorization 
in Lemma 23.2.8, 


(24.2.5) P(x, D)=(D, —A,(x, D’))(D, — A_(x, D') + w(x, D’) 


where weS~® and the principal symbol of A, is +r* in a conic neigh- 
borhood of (0, €5). There is another such factorization 


(24.2.6) P(x, D)=(D, —A_(x, D’))(D, —A,(x, D)) + &(x, D’) 


where the principal symbol of A, is +r* in a conic neighborhood of (0, £4). 
Assume now that an open interval y, on the corresponding outgoing 
bicharacteristic, with one end point at (0,0,7(0, £4)*, €), is in T*(X)\ WF(u). 
From the proof of Theorem 23.1.4 we know that Q(x, D’) can be chosen of 
order — oo outside any desired neighborhood of {(x, &’); (x,r(x, 4, e7,} 
and non-characteristic at (0,&)) so that [Q(x, D’), D, —A,(x, D’)] is of order 
— oo. Since 


(D, —A,(x, D’)) Q(x, D')\(D, — A_(x, D’)u= Q(x, D') f — Q(x, D’) w(x, Du 
7 [O(x, D’), D, = A (x, D’y) (D, —-A _(x, D'‘)yu 


we have (D,—A,(x,D’))veC” by Theorem 18.3.32 for small x,20 if v 
=Q(x, D')(D, —A_(x, D'))u and Q has sufficiently small support. Hence €, 
=r(x,¢’)* in WF(v) if x,>0 is small enough. This proves that WF(v) is as 
close to y, as we please if Q is suitably chosen. Since WF(v)< WF(u) we 
obtain veC® for small positive x, then. Hence it follows from Corol- 
lary 23.1.3 (see also the discussion after Theorem 23.1.4) that veC®™ for small 
x,20. 

Choose Q(x, D’) commuting in the same way with D, — A_(x,D’) and so 
that O(0,x’,€)=Q(0,x’, €), and set 6=O(x,D')(D,—A,(x,D’))u. For x,=0 
we have —v=Q(0,x’, D')(A_(0, x’, D')—A,(0,x’, D))ugEC® if the support of 
Q is small enough. As in the discussion of v above we also have 
(D, ~A_(x,D))’eC® for small x,20. Hence Corollary 23.1.3 yields that 
beC®@ for small x, 20. Now the equations 


Q(x, D'\(D,-A_(x,D'u=v, Q(x, D'\(D, — A(x, D)u=s, 


can be regarded as an elliptic system for D,u and u. More precisely, if 
WF(a(x, D’)) is contained in a set where Q and O are non-characteristic, we 
obtain, again by Theorem18.3.32, that a(x, D')(D,—A_(x,D’))u and 
a(x, D’)(D, —A,(x, D’))u are in C® for small x, 20. Hence a(x, D’)(A (x, D’) 
—A_(x,D’))ueC® for small x,20. Here the principal symbol 2r(x, ’)? of 
A, —A_ does not vanish at (0, £,). Choosing a(x, D’) non-characteristic there 
we conclude from Theorem 18.3.32 again that (0,,)¢WF,(u). The proof is 
complete. 


426 XXIV. The Mixed Dirichlet-Cauchy Problem for Second Order Operators 


Remark. It is not sufficient to prove that (0,¢,)€WF(D,ul,y) in order to 
make the desired conclusion. (See the remark at the end of Section 9.6.) 


Theorem 24.2.1 suggests that we introduce the following terminology: 


Definition 24.2.2. A broken bicharacteristic arc of P is a map 
IN Bath y(jeT*(X~\0, 


where I is an interval on R and B is a discrete subset, such that 

(i) If J is an interval cI\B then Jatry(t) is a bicharacteristic of P 
over X°. 

(ii) If teB then the limits y(t —0) and y(t +0) exist and belong to T*(X)~\0 
for some xe0X, and the projections in T.*(0X)\0 are the same hyper- 
bolic point. 


Thus we have a bicharacteristic arc arriving transversally to 0X at 
y(t—0) and another going out transversally from the reflected point y(t+0) 
with the same projection in T*(@X)\0. Thus y has an image } in T*(X)\0 
which is well defined and continuous also at the points in B. We shall call 7 
a compressed broken ‘bicharacteristic when a distinction between y and 7 is 
called for. 

Theorem 24.2.1 shows that a compressed broken bicharacteristic arc 
which does not intersect WF,(f)U WF(u,) is either contained in or disjoint 
with WF,(u). We shall devote the rest of this section to constructing exam- 
ples where the first case occurs and there are no other singularities. The 
constructions are related to the proof of Theorem 8.3.8 but modifications are 
required by the presence of the boundary and the variable coefficients. 

At first we shall only make a local construction at a boundary point. We 
choose coordinates there so that the principal symbol has the form (24.2.2). 
Denote by (0,&) a hyperbolic point. We shall then construct u in a neigh- 
borhood of 0 in X, defined by x, 20, so that u=0 on 0X, PueC™(X) and 
WF,(u) is the compressed broken bicharacteristic passing through (0, £9). 
Recall that it is defined by the outgoing bicharacteristic y, starting at 
(0,0, r(0, )*, €>) and the bicharacteristic y_ coming in at (0,0, —r(0, £)*, &). 
At first we look for oscillatory asymptotic solutions of Pu=0 given by formal 
series 


u,(x)=e*FOY a(x)a-s 
7) 


where 06(0,0)/0x'=5. The leading term in Pu, is A?p(x, d’(x))u, so our 
first equation is 

(24.2.7) p(x, ’(x))=0. 

By Theorem 6.4.5 we can solve (24.2.7) near 0 with prescribed initial values 


(24.2.8) $0,x)=Golx); PoO)=Fo, — Ho(0)=0. 
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The equation p(0,0,é,,€))=0 has two solutions €,= +r(0,0,&5)? and this 
gives two corresponding solutions @,(x). For these we have ¢',(x)=¢ if 
(x, €)ey,. Next we obtain the transport equation for the leading term az of 
the amplitude corresponding to $, 


(24.2.9) Y p(x, b: (x) Dag (x) + ¢(x) ag (x) =0 


where c depends on the lower order terms of p also. The j+1st equation is 
a similar equation for a; with an inhomogeneous term depending on 
agj,...,4;_, added. All these equations can be solved in a fixed neigh- 
borhood of 0 with a} prescribed when x,=0. The difference between the 
series uf obtained can then be used as in the proof of Theorem 8.3.8 if we 
take a; with support near 0 when x, =0 and take a large number of terms. 

However, we want to prove a global result. An obvious difficulty is that 
long before the bicharacteristics y, leave the local coordinate patch or 
return to the boundary the functions @, may cease to exist. Still the 
geometrical solutions as Lagrangians given in Theorem 6.4.3 do exist; the 
problem is that the projection of a Lagrangian in X may not have bijective 
differential. We shall avoid this difficulty by using the fact that a strictly 
positive Lagrangian plane in the complexification of a symplectic vector 
space S always has bijective projection on the complexification of a La- 
grangian subspace of S. (Cf. Proposition 21.5.7.) 

First we recall the details of the solution of (24.2.7), (24.2.8) given in 
Section 6.4. Denote by 74(x,,y’,7’) the solution of the Hamilton equations 
with Hamiltonian ¢€, Fr(x, ¢)?, 


dx'/dx,= FOr(x,EV/0E,  d&'/dx, = + Or(x, €)#/dx’, 


and initial data (x’,é’)=(y’,y’) when x,=0. The maps x,(x,,.) are well 
defined and symplectic from a neighborhood of (0, >) to a neighborhood of 
the point 7+(x,,0,¢o)ey4 for fixed x, (and the variable €, dropped). The 
solution of (24.2.7), (24.2.8) is equal to 0 on y, and G,, = {(x’, 0@(x,,x’)/dx’)} 
is the image of Gy = {(x’, dgo(x’)/0x’} under 74(x,,.). The solution continues 
to exist along y, as long as G, a(x’, é’)t>x’' has a bijective differential. 

So far dp has been real, but we shall now allow ¢, to be complex valued 
with the Hessian Im ,(0) positive definite. Then it follows from Proposi- 
tion 21.5.9 that the complexified tangent plane of G, is a strictly positive 
Lagrangian plane. Strictly speaking x4(x,,y’,0@ (y')/Oy) is not defined for y’ 
+0 but the Taylor expansion at y’=0 is meaningful and defines a formal 
Lagrangian at 7,(x,,0,¢5). The tangent plane at this point is the image of 
the complexified tangent plane of Gp, at (0,€ >) under the complexification of 
a real linear symplectic map, hence strictly positive. Let y, be defined for 
0Sx, Sc. In view of Proposition 21.5.9 it follows that we can find ¢,€C™ 
such that 


G1(x)=0, 06,(x)/Ox'=E, 
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Im67,/dx’? is positive definite if (x,4ey, and O<x,<c; moreover, 
66 /0x, r(x, 0¢ ,/dx’)* vanishes of infinite order on y,. Similarly we can 
solve the transport equation (24.2.9) and the subsequent ones so that az +0 
on y, and 


N 
uz y(x)=el*492 aF(x)A~s 
0 


for every N has the property that 
D* Pu y(x)= o(all-9), 
If we choose an asymptotic sum a*(x, 4)eS°(IR" x R), it follows that 
P(ci4#9e™q*(x, J))eS-°(R" x R). 


We can choose the sums so that a*(x,4)=a~(x,4) when x, =0 and take a* 
with support so close to the projection zy, of y, that Im@4(x) is strictly 
positive there except on zy,. 

Now we introduce the absolutely convergent integral 


(24.2.10) U*(x) =f el4¥2 g#(x, A) da/(a? +1). 
r 


By Theorem 8.1.9 we have WF(U*) C IRyy when x, > 0, and PU € C™ 
when x,20 so U*e.¥. The boundary values when x,=0 are equal, so U 
=U*—U~ vanishes when x, =0. If b(x, D’) is a pseudo-differential operator 
with symbol of order —oo in a conic neighborhood of (0,0,é5) then it 
follows from Theorem 8.1.9 and Theorem 18.1.36 that b(x,D’)U*eC® for 
small x,20. Thus WF,(U*)cCR,4,. We shall prove now that (0,4) is in 
the wave front set of 


D(U* — U-V(0,.)= f eee? b(x’, A)dd/(A? +1); 
b(x’, A) =2r(0, x’, 0b, /0x’)* La*(0, x’, 4)+ D, (at (0, x’,A)—a~ (0, x’, d)). 
Here beS' and the leading term 2r(0, x’,0¢,/0x’)*Aa,(0, x’) does not vanish 
x aah Im z>0 we have 
{ Ae da/(22-+1)+loglel=0(1) as 70. 
In fact, the integral is equal to 
{ae ai/a?+ 1? z'=2/\|z\. 


The difference between this integral and 


j 4da(a?+12I*)=410g(1 +|z|7)—log|z| 
9 
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is bounded as z—0 since |e'*”’ —1|<jA| and the integral from 1 to oo is seen 
to be bounded by a partial integration. Hence 


D,(U* —U~)(, x!) +270, 64 9(0)/8x’)? ag(0, 0) log |Po(x’)| 


is bounded as x’>0 so D,(U* —U~)(0,.) is not even a continuous function. 
In particular (0,€5) must be in the wave front set since it can only contain 
points along this ray by Theorem 8.1.9. This completes the proof that WF,(u) 
contains (0, &), hence also contains the bicharacteristics which are incoming 
and outgoing there. - 

We are now ready to prove 


Theorem 24.2.3. Let [a,blatte f(t) be a compressed broken bicharacteristic 
arc which has an injective projection to the compressed cosphere bundle 
T*(X)/R., and end points $(a), 5(b) over X°. Assume that the projection of 
H (f(t) in T(X°) is never 0 when }(t)eT*(X°). Then one can find ue V(X) so 
that WF,(u) is the cone generated by j([a,b]), WF,(Pu) is generated by 
F({a, b}), and u=0 on OX. 


Proof. Assuming that # contains some point over 0X we introduce local 
coordinates there and start the construction as above. (The argument just 
given could also be used to prove directly that yz C WF(U*) when 
0<x,<c, so we could also start with constructing just U* or U~ near a 
point in X° where x, can be used as local coordinate on the bicharacteris- 
tic.) The construction can be continued from x, =c with another set of local 
coordinates such that x, is a good parameter on the bicharacteristic strip. 
In fact, changing coordinates in the phases @ and the amplitudes a, we get 
initial data to use for the new coordinate patch, and if the bicharacteristic 
strip arrives at the boundary we obtain initial data for the construction of a 
reflected wave. There can never be any interference between different pieces 
of the constructed solution since the wave front sets are different by the 
assumed injectivity of the projection to the compressed cosphere bundle. 

The only remaining point is to show how the construction can be cut off 
at x,=c if that corresponds to an end point of #. To do so we choose 
KE CHR) equal to 1 in (—0oo, —2) and 0 on (—1, 00) and replace a*(x, 4) by 
a*(x,A)K((x,—c)?A), which is in Sha (cf. Example 18.1.2). This does not 
affect the singularities when x,<c but makes U*=0 when x,>c. Theo- 
rem 8.1.9 is applicable everywhere so WF(U*) is still generated by -y, for 
x; > 0. Now WF(PU*) can only contain the ray defined by the boundary 
point of +, with x; = c, and this point cannot be missing for the interior 
propagation theorem would then imply that 7, does not meet WF(U*). The 
proof is complete. 


Remark. Changing the order of a*(x, A) by multiplication with a power of 
A+1 we could construct u with any desired regularity, as we shall do in the 
analogous Theorem 26.1.5 below. 
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The functional analytic arguments in the proof of Theorem 8.3.8 can be 
used to show that also limits of the arcs 7 in Theorem 24.2.3 can carry 
singularities. This will be done in Theorem 24.5.3 below. However, first we 
shall discuss the properties of such limits in Section 24.3 and prove converse 
results analogous to Theorem 24.2.1 in Sections 24.4 and 24.5. 

In Theorem 24.2.3 we have assumed that H, is not a tangent to the fiber 
of T*(X), but this is not essential. To remove this hypothesis one can use 
the theory of Fourier integral operators as in the proof of Theorem 26.1.5. 
Alternatively one can interpret U as a Lagrangian distribution associated 
with a positive Lagrangian. In either approach the methods and notions of 
Chapter XXV are required. 


24.3. The Generalized Bicharacteristic Flow 


In Section 24.2 we have seen that singularities of solutions of the Dirichlet 
problem arriving at the boundary on a transversal bicharacteristic will leave 
again on the reflected bicharacteristic. To prepare for the study of tangen- 
tially incoming singularities we shall now examine the geometrical proper- 
ties of broken bicharacteristics nearby. In doing so it is natural to start from 
a general symplectically invariant description of the situation and then 
introduce suitable local coordinates. 

Thus we start from a symplectic manifold S of dimension 2n with 
boundary 0S, and choose ¢eC™(S) so that ¢=0 on 6S, ¢>0 in SOS, and 
d@+0 on OS. Let peC*(V) where V is an open neighborhood in S of a 
point s)¢dS, and assume that 


(i) p={o,p}=0, {4,{b,p}}+0 at so. 
Usually we shall also require that 
(i) dples+0 at So. 


Condition (i) means that p restricted to the fiber of the natural fibration of 
OS (by the integral curves of H,) has a double zero at sy. Another in- 
terpretation of the condition {¢,p}=0 is that the bicharacteristic of p 
starting from s is tangent to 0S at sg. We shall study the bicharacteristic 
foliation of p~*(0), broken by reflection in @S as in Definition 24.2.2. 

Shrinking V if necessary we may assume that V c T*(R",), s)=(0,0) and 
~=x,. Then (i) means by the Malgrange preparation theorem that with 
g+0 and a=r=0 at 0 


P(x, C)=9(x, G)(E, +a(x, EI)? r(x, 2), = (Ea5--- Syd 


We can take new symplectic coordinates with y, =x,, 4,=¢,+a(x, €’). Since 
0=H,,r=H,,r it follows that with the new coordinates p has the same form 
but with a=0, as we assume from now on. The factor g does not affect the 
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set p~+(0) or its foliation, just the parameter on the bicharacteristics, so we 
shall assume that g=1, thus 


(24.3.1) P(x, Q)=Fi—r(x, 2), C=(E2,--.58,), 


where d,, ..r(0,0)+0 if condition (ii) holds. This is the same form of p as the 
one which we obtained in Section 24.2 for the principal symbol of a second 
order differential operator in a manifold X with non-characteristic bound- 
ary 0X. 

Definition 24.2.2 has a symplectically invariant meaning in a neigh- 
borhood of sp: y(t) shall be a bicharacteristic of p contained in SX 0S when 
t€éB, and when teB the limits »(t+0) shall exist and belong to the same leaf 
of the natural foliation of 0S. 

We shall write Y=p~‘(0) for the characteristic set and Y for the com- 
pressed characteristic set obtained by identifying points on the same leaf of the 
foliation of 6S. Thus x, € are continuous functions on XY and define the 
topology there while x, x,€,, ¢’ are continuous on x and define its to- 
pology. The subset of 2 where x, =0 is denoted by 2); it is the image of all 
(x,é) with x,=0 and €?=r(x,2’)20. The set Gc where x,=f,=0 is 
called the glancing set; it is the set of all points in ¢~ (0) p~*(0) where H, 
is tangent to OS. It is mapped topologically into XY so it can be identified 
with its image there, and H=2 )\G is the hyperbolic set with the termi- 
nology of Section 24.2. We can aise define invariantly the glancing set G* of 
order at least k,k=2,3,..., by the equations 


(24.3.2) p=0 and Hip=0 for 0Sj<k. 


Thus G=G?>G?>...>G® are closed sets. The definition depends only on 
p~1(0) and @S, for p and ¢ are determined by p~‘(0) and dS up to smooth 
factors +0 which only affect the parametrization on the bicharacteristics 
and not the order of the zero of ¢. When ¢(x)=x, and p is given by (24.3.1) 
we set 7,(x’, ¢')=6/r(O, x’, €)/6xi, j =0, 1, and shall prove: 


Lemma 24.3.1. Let k be an integer =O. Then y,=(0,x’,0, &')eG**? if and only 
if 

(24.3.3) r=0 and Hir,=0, O<j<k, 

at Yo, and then we have at yo 

(24.3.4) H**? 6 =2(—H, Jr 


Proof. With H° denoting the Hamilton field dr/0&’0/0x’ —dr/éx'd/éf of r 
for fixed x,, we have 


H,=2€,0/8x,+14)0/0€,-H?, 4) = Or/Ox,. 


Hence H,x,=2¢,, H}x,=2r,, which proves the lemma when k=0. By an 
obvious inductive argument it remains only to show that (24.3.4) is valid if 
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k>0, yo¢G**? and (24.3.3) holds. By hypothesis Hiy=0 at yo for j<k+1 if 
y=0 when x, =0. Thus we have at y, 


Hkt7@ =Hf2r, =2(2€,0/dx,+1r,0/0€, —H,,)*r;, 


for working from right to left we can successively put x,=0 in each of the 
factors 7,, and H, since at most k factors H, act from the left. We can now 
drop 0/0x, since nothing depends on x,, and then 0/0f, because €, no 
longer occurs. This completes the proof of (24.3.4) and of the lemma. 


The bicharacteristics of p starting at a point yp with ¢=p=H,@=0 but 
H>>0 will lie in the interior defined by #>0 in a deleted neighborhood 
of the starting point y,. On the other hand, if p is extended to negative 
values of x, then the bicharacteristic will immediately leave the half space 
20 if H? <0. This convexity (concavity) of 0X with respect to tangential 
bicharacteristics leads to drastic differences in the behavior of the broken 
bicharacteristic flow. We begin the discussion by summing up and extending 
our terminology: 


Definition 24.3.2. The glancing set G* of order at least k2=2 is defined by 
(24.3.2); G=G? is just called the glancing set. The glancing set G?\ G? of 
order precisely 2 is the union G,UG, of the diffractive part G, where 
H}¢>0 and the gliding part G, where H?<0. 


With the notation in Lemma 24.3.1 the diffractive (resp. gliding) sets are 
defined by x, =€,=r)=0 and r,>0 (resp. r, <0). To investigate the broken 
bicharacteristics near G, and G, it is by Theorem 21.4.8 sufficient to study 
the following example provided that (i) and (ii) hold. 


Example 24.3.3. If p(x,é)=€7+x,+6,, $(x)=x,, then the glancing set is 
defined by x, =¢,=¢,=0, and H}¢= 2 so it is gliding for the upper sign 
and diffractive for the lower sign. The invariants of the Hamilton flow are 


Nerroig Mp nas 6 gs veer Gy ake nds Cex. 


The broken bicharacteristics are displayed in the x,x, plane for the diffrac- 
tive and gliding cases in Fig.3 and Fig.4. We have €,=0 when dx,/dt=0. 
In the diffractive case the only limits of broken bicharacteristics are the 
tangent parabolas defined by €7=x,, €,=0, €; —Xyqs X25 00-3Xp—q> Case Ena 
all constant. In the gliding case €7+x, is equal to the maximum value c? of 
x,. Thus |&,|Sc, O0Sx,Sc?, &,=—c? and the different parabolas in Fig. 4 
differ only by a translation. 

When c-0 the broken bicharacteristic degenerates to a line dx/dt 
=(0,...,0,1), dé/dt=0 in the gliding set, defined by the Hamilton field of €,. 
This is called a gliding ray. 


Instead of using the fairly deep result in Theorem 21.4.8 and in order to 
avoid using condition (ii) we shall now show directly that the conclusions in 


24.3. The Generalized Bicharacteristic Flow 433 


x Xn 


x; x 


Fig. 3 Fig. 4 


the example are valid quite generally. As usual V is an open set in the half 
space defined by x,20 such that V is compact, and p is defined in a 
neighborhood of V in this half space. 


Lemma 24.3.4. Assume that ér/6x,2c>0 in V and that try(t), 0<|t|<T, is 
a broken bicharacteristic reflected when t=0. Then x,=ct? so y cannot return 
to 0S when maximally extended. 


Proof. Since dé,/dt=0r/0x,2c and €,(+0)20, it follows that €,()>ct if 
t>0, €,()<ct if <0. Now dx,/dt=2é, and x,(0)=0 so it follows that 
x,(t)2ct?. 


From the lemma it follows that a limit of broken bicharacteristics is 
either a broken bicharacteristic or else a bicharacteristic starting from a 
point in the glancing (thus diffractive) set. 


Lemma 24.3.5. Assume that 0r/0x,<—c<0O in V and that y is a broken 
bicharacteristic. Then there are constants C, and C such that 
(24.3.5) E(t)? +x, (DS Coe '-(E,(s)? +x, (5) 


for any two points on y. 


Proof. For g(x, €)=€? —x, 6r/0x, we have 
ef tex, Sex, )scit+Cx, 
if @r/oéx,2—C' also. At the points of reflection g(x, €) is continuous, and 


elsewhere we have 


dg(x(t), €(t))/dt =2€, or/Ox, —2€, ér/dx, —x, d(ér/0x,)/dt. 
Hence 


|dg(x(t), ¢())/dt| < Cg(x(0), ¢(0), 
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which implies that 
g(x(t), Ed) Se"! g(x(s), &(s)). 
(24.3.5) follows at once with Cp=C'/c if ce<1<C’. 


If |E,(0)|<e, x,(0)<e’, it follows for t in a bounded interval that €,(¢) 
= O(e), x ,(t)=O(e”). Thus 


d(x’, €)\/dt = — H? = —H,, + O(e), 
which implies that 


(x'(), £0) =exp(—tH,,.)(*'), €(0)) + Ole”). 


A sequence of such broken bicharacteristics y,(t) defined for |t}<T and 
converging for t=0 to a point (0,x5,0,& 5) with ro(xp,¢5)=0 will therefore 
converge uniformly for |t|<T to the gliding ray which is the integral curve 
of —H,,. (These give a foliation of the glancing set defined by x,=€, =" 
=0 wkave H,,+0. When H,,=0 the gliding ray does not move.) By the 
differential equation (xi, & () i is in fact C' convergent. 


In the glancing set we have 
H,=1r,0/05,—H,,, H,=—0/08). 


Thus —H,,=H,+AH, with 4 chosen so that this vector field is tangential 
to the glancing set x,=¢,=r)=0. Invariantly this set is defined by p=@ 
={p,¢}=0, so A must be chosen so that H7$+1H,H,¢=0, that is, A 
=H 2 b/H3p. 


Definition 24.3.6. The vector field 
HS =H,+(H?$/H3p)H, 
tangent to G is called the gliding vector field. 


An alternative interpretation is that Hf is the Hamilton field of p 
restricted to the symplectic space where ¢= {p, 6} =0. In fact, with our local 
coordinates this is defined by x,=¢,=0, and p restricts to —r,. Note also 
that H@=H, in G° so the vector field which is H? in G\G, and H, on G, 
is continuous. He is independent of the choice of ¢, as the notation 
suggests, and if p is replaced by gp, g+0, then Hy © like H, is just multiplied 
by g. This gives a consistent change of parametrization in the following 
definition suggested by Lemmas 24.3.4 and 24.3.5, 


Definition 24.3.7. A generalized bicharacteristic arc of p is a map 
IN Batrey(te(S\ 0S)UG 


where I is an interval on R and B a subset of J, such that poy=0 and 
(i) y(¢) is differentiable and y'(t)= H,(y(0) if pOeS~OS or y(HeG,. 
(ii) y(t) is differentiable and y'(t)=H a(y(t)) if y(thEG~ G,. 
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(iii) Every teB is isolated, y(s)eS~ 0S if s+t and |s—t| is small enough, 
the limits y(t+0) exist and are different points in the same (hyperbolic) fiber 
of OS. 

The continuous curve } obtained by mapping y into Z is called a 
compressed generalized bicharacteristic. 


The definition has been stated in an invariant form, but condition (iii) 
assumes tacitly that p can be given the form (24.3.1). From now on we 
assume that we have such coordinates and set y(t)=(x(t),é(t)). Then x’(t), 
é(t) are continuously differentiable in J, 


dx'/dt=—dr/0@, dé'/dt =0r/dx’. 


x, is continuous and left and right differentiable with derivative 2£,(t + 0) 
also if t¢B. Even then the right and left derivatives 


lim (&,(t+e)—€,(t +0) (+6) 
e>++0 


exist and are equal to dr/dx, except in G, where the derivative is 0. In 
particular we obtain a uniform Lipschitz condition for x(t), x,(t)é,(t) and 
é(t) when y(t) remains in a fixed compact set. Since ¢? =r(x, €’) we also have 
a uniform Lipschitz condition for €7 and a Hélder condition of order 4 for 


i . 
If y(O)e(SN. 0S)UG, then »(t) remains in this set for small t. In fact, if 
y(O)eG, then there can be no reflection point y(t) for small t, by Lem- 
ma 24.3.4, and y(t)}€G~ G, since this is a closed set. Hence it follows from 
condition (i) that tr y(t) is for small t just the orbit of H, through »(0). 

If y(O)eG, then y(t) is for small t the gliding ray through (0). In fact, 
there is a neighborhood of y(0) where (24.3.5) is valid. If x,(t) and &,(t) do 
not vanish identically for small t it follows that y(t) is a broken bicharacter- 
istic and we get a contradiction when t-0. Thus y(t)€G, and the assertion 
follows from condition (ii) in Definition 24.3.7. 


We shall now study the generalized bicharacteristic when »(0) is close to 
G?. Let (y(t), 7'(t)) be the solution of 


dy, n)/at= —H,(y,0);  yO)=x'0), — n'O)=¢'); 


that is, the orbit of the gliding vector field with the same initial data. 
Subtracting from the Hamilton equations for x’,é’ we obtain if y(t) is 
contained in a fixed compact set for O<t<T 


ld(x’—y’, ¢' —nV/dt| S Co(lx’ -y'|+1F — n+ yxy. 
With f()=|x'()—y'(Ol+|€(0—n'(O] it follows that 
df W/dtSCof(+C,x,0, f(0)=0. 
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Set e(t)=r,(y'(), n'() ="), yO, (0) where r/,)=0r/0x,. Then 
Inay<(0), F()-eMl SC, f(H+ Cx, (0), 
which means that for both left and right derivatives 
a|¢ ,(t)\/dt Sle(t)| + Cz f(t)+ C3x (0). 


Furthermore we have dx ,(H/dt$2|¢,(|. 
Consider now a solution X,=, F of the differential equations 


(24.3.6) dX/dt=28, d&/dt=|e(t)|+C,F+C,X, dF/dt=C,F+C,X, 


with X(0)>x,(0), 2(0)>|€,(0)|, F(O)>0. We claim that x,<X, |€,|<5 and 
f<F in [0,T]. In fact, if s is the largest number <T such that this is true 
in [0,s) then X—x,, E—|€,|, F—f are increasing in [0,s), hence positive at 
s, which proves that s=T and that the inequality is valid in [0,7]. Hence 
we can also estimate x,, |¢,|, f by the solution of the equations (24.3.6) with 
X(0)=x,(0), 2) =|¢ ,(0)|, F(0)=0. 

Let F, X, = be the solution of 


dX/dt=28, d5/dt=C,F+C,X, dF/dt=C,F+C,X 


with X(0)=F(0)=0 and 3(0)=1. Then X(t)=2t+ O(t?), F(t)=C,t? + O0(t) as 
t—0, and (X(t), S(t), F(t))=O(e") as t-+00. Differentiation gives a solution 
of the same system with initial data (2,0,0), and the two solutions allow us 
to solve (24.3.6) with initial data x,(0), |é,(O0)|, 0. This proves that for 
O<t<T 


t 


x,(t)< Ce" ( (t—s)|e(s)|ds+x,(O)+ H1E,(0)) 


0 


(243.7) —‘|é,()1S Cet! (5 le(s)| ds-+tx,0)+12,0)1), 
fs Cet ( (ts)? le(s) ds-+tx,(0)+1712,0)). 
0 


Let us now assume that y(0)eG*, k>2. By Lemma 24.3.1 this means that 
e(t)=O(t"—*) as t>0. From (24.3.7) we then obtain that x,(t)=O(t*) and 
that €,(t)=O(t*~*), just as we would have for the orbit of H, starting at y(0) 
if no boundary interfered. This is far more than differentiability of x,(t) and 
é,(t) at 0. 

Assume for a moment that y(0)eG*'\G**t! for some k>2. Then it 
follows from Lemma 24.3.1 that 


e(t)=at*-?+0(t*-1), — e’(t)=a(k—2)t*-3(1 + Ot) 


where 2a=H* b(y(0))(k—2)! +0. Hence e is monotonic for t+0 small 
enough. This monotonicity alone suffices to determine y(t): 


24.3. The Generalized Bicharacteristic Flow 437 


Proposition 24.3.8. If y(0)eG*> and e(t) is increasing for small positive t then 
y(t) is for such t the orbit of H, starting at y(0). If y(0)eG? and e(t) is 
decreasing for 0<t<T then y(t) is a gliding ray for OStST. 
Proof. Assume that e(t) is increasing for OSt<T. With the notation above it 
follows that for O<$t<T 
x,()=O(7e(t), E()=Ote(t), fM=O(t ele), 
Kay(x(t), (2) —e(t) =O e(t)). 

Hence dé,/dt=r,(x(d),¢())20 (for left and right derivatives) if t is small 
enough. Since jumps of €, at reflections must be positive it follows that €, is 
increasing. If e is equal to 0 for O<t<T then there can be no jumps and x, 
=¢,=0 so we have a gliding ray in G? which is also an orbit of H,. On the 
other hand, if e is not equal to 0 for all small positive t then €,(t)>0 for 
t>0 so x,(t) is strictly increasing. Thus y(t)eS~ dS so y(t) must be an orbit 
of H,. 

Now assume that e(t) is decreasing for O$tST. In view of the discussion 
above we may assume that e(t) does not vanish for all small t, thus e(t)<0 


for 0<t<T. Then 
ray(x(t), &'(t)) =e(t) + O(t7 e(t)) <0 


if t is small enough. As in the proof of Lemma 24.3.5 we introduce 
g(t)= &,(t)? —x,(t) Nyx (0), ¢(t)) 
and observe that 
g'(t)= —x (dn, (x0), o'()/at. 
Since 
e'(th=d " 19, y'(0), n'(0)/dt, 
x(Q=O(t7e(t)), f(H=O(e(t), dx ,(Q/dt=2E (4) =O(ce(), 
d(x'(t)—y'(t), 2()—n'(o)/dt = O(t? e(t)), 
in view of the Hamilton equations, we obtain 
dry (x(t), o(1)/dt —e'(t) = O(te({t)). 
By the obvious estimate g(t)= —x,(t)n,,(x(0), ¢'()) we have 
g'(t)Sa(t)(—e'(t) + O(te(t))(—e(t) + O(t e(e))) 
Sg(t)(2e(t)/e(t)+ Ct) 
when t¢ is so small that |O(t? e(t))| <|e(t)|/2. Hence 
d(g(t)/e(t)”)/dt =(g'(t)—2a(the'(t)/e(t)\/e(t)? S Ct g(t/e(t)’. 
Since g(t)/e(t)? =O(t?)>0 when t-—0 it follows by integration of this in- 
equality that g(t)/e(t)? =0 identically. The proof is complete. 


We can of course argue in the same way for t<0. In particular, we have 
therefore proved: 
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Theorem 24.3.9. Let tt y(t) be a generalized bicharacteristic, let y{ty)eG\ G® 
and denote by y, the gliding ray with y,(to)=y(to). Then it follows that in a 
one sided deleted neighborhood of t) we have either y,(t)eG, and »(t)=y,(t) or 
else y,(t)eG, and y(t)=exp(t H,) (to). 


Corollary 24.3.10. A generalized bicharacteristic with no point in G® is unique- 
ly determined by any one of its points. A generalized bicharacteristic is 
constant if it contains a point in GN G, where H¢=0. 


The following example shows that there is not always uniqueness when 
the monotonicity in Proposition 24.3.8 fails: 


Example 24.3.11. We shall construct below a C® convex curve y= f(x) in 
IR? such that f(x)=0 for x<0, and an inscribed polygon with vertices 
converging to 0 which satisfies the reflection law. This condition means 
precisely that with X = {(x, y,t)eIR?; y= f(x)} the polygon with t defined as 
arc length and the dual variables €, y, t defined by t= —1 and (&,4)= 
(dx/dt,dy/dt) is a broken bicharacteristic for the differential equation 
D? —D —D? which approaches a point (0, 0, t, ~1,0, —1)eG® without being 
equal to the gliding ray. To construct the curve and the polygon we first 
choose the polygon with the corners 


(x, ¥)= a (cos a, Sin a;) 
J= 

where «, is decreasing and tg ((«,—0,;, ,)/2)=277. Thus the tangents of half 
the outer angles are 2~/. Choose a C® convex function g(x) such that g(x)= 
~—x for x<0O and g(x)=2(x-—1) when x>I1i. Then the graph of 
(0,k~*)axt+2-"g(k?x)/k? rotated by the angle «, and translated to 
(Xy+1>Ye41) defines the desired curve if k2k,. We can extrapolate linearly 
to the right of the first interval. The curve is then in C® for x>0 and all 
derivatives of f tend to 0 when x-0 so feC™(R) if we define f(x)=0 
when x0. 


We motivated the definition of generalized bicharacteristics by studying 
limits of broken bicharacteristics at G, and at G,. It will now be proved that 


Fig. 5 
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limits of broken bicharacteristics are indeed generalized bicharacteristics. 
More generally, assume that we have a sequence of generalized bicharacter- 
istics [a,b]atry’(t)eK where KS is a compact set. If y’(t)=(x"(1), (0) 
we have already observed that x’(¢), x}(t) €}(t), €7(0)? and €”(t) are uniformly 
Lipschitz continuous. For a subsequence these functions are therefore uni- 
formly convergent. 


Proposition 24.3.12. Let [a,b]atrey’(j=(x"(t), €() be generalized bicharac- 
teristics such that x(t), x3(t) Ey, E1(t)? and &°(t) are uniformly convergent 
when v— 00. Then there is a unique generalized bicharacteristic [a, b]atr> y(t) 
such that y’(t)— y(t) uniformly for t in any compact subset of [a,b] containing 
no reflection point of y(t). 


Proof. Let ty be a point such that the limit of xj/(f) is not 0. Then y(t) 
=lim y’(Q\eES~ OS exists uniformly in a neighborhood of ty. From the Ham- 
ilton equations it follows that we have C! convergence and that y(t) is also 
an orbit of H,. Now let ty be a point such that x{(t9)>0 but the limit of 
Et(to)? is not 0. Since dx}(t)/dt=2€{(t) it follows from the implicit function 
theorem that there is a sequence t,>t, such that x}(t,)=0. In a neigh- 
borhood of t, independent of v it follows that y’(t) is for t<t, the orbit of 
H,, which comes in at y"(t,—0) and for t>t, the orbit which goes out at 
y(t, +0). This proves that y(tg+t)=limy(t,+2) exists for fixed small t+0, 
thus with C! convergence for small t>0 or t<0; the limit is a broken 
bicharacteristic. 

Let T be the closed set of all te{a,b] such that xj(t) 0 and €7(t) 00. Then 
y(t)=lim y*(t) exists uniformly for teT, and y(t)eG. We have already proved 
the existence of the limit at all points in [a,b] T except for the countably 
many reflection points. If y(tp)eG, or G, for some t)€T, it follows from 
Lemmas 24.3.4 and 24.3.5 that y’(t) converges uniformly in a neighborhood 
of fy to the orbit of H, resp. the gliding ray through y(t). If y(tp)eG? we 
obtain by applying (24.3.7) to y(t—t,) and letting vooo that x,()= 
O((t —to)°), €,(t) = O((t —t,)”) so x,(t) and €,(¢) are differentiable with derivative 
O at %. From the Hamilton equations for x”, ev’ it follows that x’(t), €’(t) 
are everywhere differentiable with derivative —H,°(x(t), €(t)) which proves that 
7 is a generalized bicharacteristic. 

For any e>0 we have |y*(t)—y()|<s, teT, if v>v(e). By the uniform 
continuity of x}, |€}|, €” it follows that |y*(t)— y(0)|<2e if v>v(e), for all t in 
a neighborhood V of T independent of v. In any compact subset of 
[a,b]~ V containing no reflection point of y(t) we have uniform convergence, 
which completes the proof of the stated uniform convergence of y”. - 


Our next aim is to show that through every point in 2 there is a 
generalized bicharacteristic which is a limit of broken bicharacteristics. To 
do so we must make some preliminary remarks on the bicharacteristic flow. 
Let peC™(S) where S is a symplectic manifold, and let sy, s, be points in S$ 
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with p(so)=p(s;)=0 and exp(toH,)sy)=s,. Then one can choose neigh- 
borhoods S, and S, of s, and s, such that 


{(s',s")ESq x S,; p(s')=p(s")=0, s” =exp(tH,)s’ for some t close to to} 


is a canonical relation. In fact, we can choose symplectic coordinates y, 7 
vanishing at s, such that p=, in a neighborhood. Then the pullbacks (x, &) 
of (y,n) by exp(t)H,) are symplectic coordinates at sy which vanish at so, 
and p=O is equivalent to €,=0 there. (See Section 6.4.) The relation is then 


{(x, S35 y.n)3 €, =n, =0, x;=y, and ¢,=n, for j+1} 


which is obviously canonical. (See also Proposition 26.1.3.) 

Let us now return to the Hamiltonian (24.3.1) in a convex neighborhood 
of 0eT*(IR".) where we assume that 0p/0é,>0. This implies that dx,/dt>0 
for the (generalized) bicharacteristics and also that the map 


{(2e, £)5 p(x, 6) =O} 30%, Cr 4, 0 Xn a Sas Sn ET AR") 


is injective and has bijective differential for fixed x,. The pullback of the 
symplectic form in T*(IR") by the inverse map is the symplectic form in 
T*(R"~'). For fixed s, teR let yi map (x4,...,%, 4) €45--00E,-1) to 
(Vio-+*sVn—-a Mis -++> In_1) if there exist some €, and 7, such that p vanishes at 
the point (x,,...,X,_4) 8, 4,---56,)€S and the orbit of the Hamilton field H, 
Starting there arrives at (y,,..-,Ya_y> t, 1.---.%,) Without reaching the 
boundary of S. This is a canonical transformation for it is obtained by 
restricting the canonical relation described above to fixed x, and y,. For the 
same reason the map y,(X,,..-,X,_15 €45++-»,-1)=(’,7) defined when the 
bicharacteristic first meets the boundary transversally at (0,y, 
+r(0, y’,n’)?, 7’) is also a canonical transformation. By composing such in 
and outgoing canonical transformations we obtain 


Proposition 24.3.13. Let p be given by (24.3.1) in a convex neighborhood S of 
OeT*(R",) where dp/dé,>0. For s,teR define 


Mois ov Meanie 959 Gg 1) — Osos Vy cite Wists Naa) 


if there exist some €, and n, such that (X4,..-,Xq_158)645-++50,) and 
(Vis -+esVa—a9 te Ms -+e9M,) are zeros of p in S connected by a broken bicharac- 
teristic arc in S. If s and t are small enough then ¥, is a symplectic transfor- 
mation defined in an open set which contains a neighborhood of 0 in T*(IR"~') 
apart from a set of measure 0. 


Proof. When (xj, ---,%n—12619---s€n—1) iS defined, the broken bicharacteris- 
tic considered has a finite number N of (transversal) reflection points. By the 
implicit function theorem we get such a broken bicharacteristic for all 
(x{,.+-»& 1) in a neighborhood of (x,,...,¢,_,); as the composition of N+1 
C® symplectic maps in T*(R"~') the map 7% then defined is C®” and 
symplectic. By the uniform continuity of x, |é,|, & on generalized bicharac- 
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teristics we can find 6>0 such that for |s|+|t|<6 the map 7%, is defined and 
uniformly bounded in the set A of all (x,,...,¢,_,) with |x,[+...+1&,_.|<6 
unless the broken bicharacteristic y starting there comes arbitrarily close to 
a point in G before x,=t. Choose ¢ so that (t—s)/e is a large positive 
integer, and let A, be the set of points in A such that x, Se” and |é,|<e at 
some point y(t). Then it follows from (24.3.7) that x, < Ce’, |€,|< Ce at y(ve) 
if v is the largest integer with ve<t. The measure of all (x,,...,¢,_,) with 
O<x,<Ce’, |E,|<Ce and fixed bounds for x,,...,€,_, is O(e*) and it is 
equal to the measure of the inverse image under the symplectic map 7\°. If 
we sum for all positive integers v<(t—s)/e it follows that the measure of A, 
is O(c”), hence () A, is of measure 0, which completes the proof. 


é>0 


Corollary 24.3.14. Under the hypotheses in Proposition 24.3.13 there is some 
5>0 and a neighborhood V of 0 in T*(IR",) such that for every yyeVp7*(0) 
there is a generalized bicharacteristic (—6,5) > t + y(t) € S with y(O) = ‘. (If 
¥(t) has a reflection point for t = 0 this means that (+0) = ‘Yo or y(—0) = 0.) 


Proof. By Proposition 24.3.13 one can choose V and 6 so that there is such a 
broken bicharacteristic for every yyeVnp~'(0) outside a null set in the 
manifold p~1(0), even for fixed x,. Since these broken bicharacteristics are 
uniformly continuous it follows from Proposition 24.3.12 that there is a 
generalized bicharacteristic for every yo. 


Remark. From this corollary and the standard existence theorem for the 
vector field H, in the interior of S one can conclude the same global 
existence theorems for generalized bicharacteristics as for bicharacteristics 
on an open manifold. The proof can also be used to give generalized 
bicharacteristics which are globally limits of broken bicharacteristics. In 
view of Coroilary 24.3.10 it follows that every generalized bicharacteristic 
which does not intersect G® is a limit of broken bicharacteristics. However, 
it is an open problem if every generalized bicharacteristic can be obtained 
as such a limit. 


We shall now prove a stronger existence theorem for generalized bichar- 
acteristics which will be essential in the proof of an extension of Theorem 
24.2.3. 


Lemma 24.3.15, Let the hypotheses of Proposition 24.3.13 be fulfilled, and let 
F be a closed subset of S\p~'(0) such that 

(i) If yer (G,UG*) then F contains a neighborhood of y on the broken or 
diffractive bicharacteristic through y. 

(ii) For every compact subset K of F A(G,UG") and e>0 there exists 
some 6>0 and y,(t, y,)eF, —OStS6, yyeK, such that 


(24.3.8) I(velt, Yo) —Yo/t Hp Yoll'<e,  O< |e SS. 
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Here HY (yo) is the tangent of the gliding ray, and |v|' denotes for veT*(IR") 
=]R”" the Euclidean length when the €, component is dropped. Then there 
exists for every y,€F a generalized bicharacteristic y(t), t_<t<t,, such that 
t_<0<t,, pO=Yo, y((t_,t,)) CF, and y(t) leaves every compact subset of F 
when tt... 


Proof. Since dx,/dt=@p/dé,>0 on a generalized bicharacteristic it is suf- 
ficient to show that for every y, in a compact set KF there is a generalized 
bicharacteristic through y, with end points on the boundary of K in F. To 
do so we first choose 6 as in condition (ii) for some small ¢>0, and then 
define an approximate generalized bicharacteristic as follows. If yp¢G, UG? 
we define y,(t) as the (broken) bicharacteristic of p with y(0)=y, as long as 
it exists and is contained in K. We are then through for positive t unless we 
get y-(t) defined just in an interval 0 < t <r. The limit of y,(t) as t — 7 must 
then exist and belong to G, U G?. Now we define ¥,(t) for 7 St $7 +6 by 


yelt)= 7, (t) +(7,(6, Ye(t)) —y_(t)) (€-1)/6 


where we have used the function in condition (ii), Continuing in this way 
also for negative t we arrive at the boundary of K in F at some bounded 
time t since we have a positive lower bound for dx,/dt on the curve 
[t_(e),t,.(e)]atry,(t) obtained. The end points of the curve are thus on the 
boundary of K in F. In view of the uniform Lipschitz continuity of the 
coordinates other than €, we can choose a sequence ¢,->0 such that a limit 
y(t)eF exists as in Proposition 24.3.12. Since G, UG is closed it follows as 
there that y is a broken bicharacteristic elsewhere. If y(t)eG, then y(s) must 
be the oliding ray through »(t) for small |s—d|. In fact, if v(s)¢G, for some s 
in any neighborhood of ¢ then it follows from Lemma 24.3.5 that we must 
have a broken bicharacteristic there. Thus y(s)eG, for small |s—t| so x,(s) 
=€,(s)=0 then, and 
d(x'(s), ¢(s))/ds = — H,,(x'(s), ¢(s)) 


since for y, the derivative of these components is —H°(y,(s)) or —H,,(y,(s) 
+0(6)) except at finitely many points. Finally, if y(t)eG* we obtain in the 
same way that (x(s),¢é'(s)) is differentiable at ¢ with the derivative 
(0, — H,,(x’(t), ¢’(t))). It remains to prove that €,(s) is differentiable with de- 
rivative 0 at t, that is, that &,(s)? =r(x(s), €’(s))=O((s—t)”). Now 


Ir(x(s), €'(s)) | +1 %1ye(s), 6's) = O(s — 2) 


since the left-hand side is Lipschitz continuous and vanishes at t. The right 
and left derivatives of r(x(s),é’(s)) are — 74)(x(s), ¢'(s))dx,/ds since 
d(x'(s), '(s)/ds = — H°(x(s), €’(s)) which makes the other terms drop out. 
Here the left and right derivatives dx,/ds are equal to +€,(s)= 
+r(x(s), &’(s))? so the derivatives of r(x(s), &’(s)) are O(\s —t|*). Thus 


r(x(s), €(s)) = O(|s—t\?) 


which completes the proof. 
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In Sections 24.4 and 24.5 we shall respectively verify conditions (i) and 
(ii) in Lemma 24.3.15 for WF,(u) when u satisfies a second order differential 
equation with smooth right-hand side and has smooth boundary values; 
condition (i) is then already proved in part by Theorem 24.2.1. 


24.4. The Diffractive Case 


As in Section 24.2 we assume that P is a second order differential operator 
with real principal symbol p and C® coefficients defined in a C® manifold 
X of dimension n, with non-characteristic boundary 0X. The present section 
is devoted to the proof of the following analogue of Theorem 24.2.1. Recall 
that the diffractive set G,<T*(0X) can be identified with a subset of 
T*(X)lox- 


Theorem 24.4.1. Let u, fe.(X), upéD(OX), and assume that 

(24.4.1) Pu=f in X~O0X, u=u,_ in OX. 

If ye(WE,(uynG,)~ (WE,(f)UWF(u,)) then a neighborhood of y on the orbit of 
H,, through y is in WF,(u). 


In the proof we may again assume that X CR", ={xelR"; x, 20} and 
that p(x, £)=€4 —r(x, ©). Thus 


P=D?—r(x, D)-Y Ri D,—Ro(x). 


If c is a C™ function then the operator ure‘ P(e’ u) has the same principal 
symbol but R, is replaced by R, -2D,c. We can therefore eliminate the D, 
term and assume in what follows that 


(24.4.2) P=D?—R(x,D’) 


where R is a differential operator in the variables x’=(x,...,x,) With 
principal symbol r(x, &’). We may assume that the coefficients have compact 
support. 

The first step is now to extend the energy identity (24.1.2) by substituting 
for the first order differential operator )' f'd; an operator which is pseudo- 
differential along the boundary but a first order differential operator in x,. 
It is convenient later to have the total order equal to 0 so we set 


(24.4.3) Q(x, D)=Q,(x, D')D, + Qo{x, D), 


where Q, is a pseudo-differential operator in the x’ variables of order —j 
and principal symbol q,. We shall assume that Q is self adjoint, 


(24.4.4)  Q*=Q, that is, Q*=Q, and Q*=Q,—[D,,Q,], 
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which implies that q. and q, are real. Let ueC?(X) and set uj;=Diu. With 
(,)y and (,),y denoting the usual sesquilinear scalar products in X and in 
OX, we shall prove 


Lemma 24.4.2. With P of the form (24.4.2) and Q of the form (24.4.3) satisfying 
(24.4.4), we have if ue CF (IR",) 


1 1 
(24.4.5) 2Im(Pu,Qu)y=) (By (x', Duy, ujax+ >, (Ciy(x, D) uy, u,)y- 
0 0 


Here B,,=Q,, B§,=B,,o=Qo, Bop =Qi(R+R*)/2, for x,=0, the principal 
symbol c,, (of order 1—j—k) of Cy, is real, cy, =C;9 and 
(24.4.6) Deejles, 2) E4°* = {p,q} +2q Im p, 


where 
(X, C= 41S HBO a) —YR,(x)é; 
2 
is the subprincipal symbol of P. 


Proof. First we compute the contributions from the term D?, 


(Dj u, Qu)y—(Qu, Dj u)y)/i= (Dy u, Qu)ox +(Qu, Dy w)ox 
+((Dyu, QD, u)x +(D, u, [D,,Q]u)x— (QD, u, D, u)y —((D,, Q) u, Dy u) x). 


Another integration by parts gives in view of (24.4.4) 
((D, u, QD, u)y—(QD, u, D, u)y)/i= —(Q, Dy u, Dy way. 


The boundary terms add up to those in (24.4.5) with j+k+0. The operator 
[D,,.Q]/i is self adjoint with principal symbol —déq/éx,, so we get the 
contribution 2é, 0q/0x, ={&?,q} to the sum in (24.4.6) from the terms in the 
second sum (24.4.5) now obtained. 

It remains to study 


((Qu, Ru)y —(Ru, Qu),)/i. 


In doing so we set R=R’'+iR” where R' and R” are self adjoint. Both 
operate in the x’ variables and R” is of order 1. Now 
((Qu, R’ u)y ~ (R’ u, Qu)y)fi = (R’ u, 0, U)ox + (LR’, Q] u, u)/i, 
—~ (R” u, Qu), ~~ (Q u, R’ u)y 
= —(Q, R’u,D, u)y —(R" Q, D, u,u)y—((Q5 R" +R" Qo) u, uy: 
which gives the term By,=Q, R’ in (24.2.5) and contributes to the sum in 


(24.4.6) the terms — {r,q}—2qr’, where r” = —Imp, is the principal symbol 
of R”. The proof is complete. 
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Theorem 24.4.1 will be proved by means of an estimate derived from 
(24.4.5). To see what conditions Q ought to satisfy we first observe that if 
u=0 on @X then (B,, u,,u4,)sy is the only boundary term in (24.4.5). In view 
of the sharp Garding inequality (Theorem 18.1.14) it can be estimated above 
by Clu, (, NZ 1 if q,S0. If Y ca (x, €) 4** sO then 


DC jal, 6')Z,.2;0, (Z9,z,)eC’, 


since (c,,) is real and symmetric. Hence the sharp Garding inequality for 
systems (see the remark after Theorem 18.1.14 or Theorem 18.6.14) shows 
that the real part of the second sum in (24.4.5) can be estimated above by 
CY |lujllfo,_p, that is, Cllull?,,-,, by Theorem B.2.3. (In Appendix B the 
half space IR", is defined by x,20 while it is defined by x, 20 here; this 
change of notation should cause no confusion.) Actually we are using a 
slightly modified version of the sharp Garding inequality with different 
degrees assigned to u, and uw, just as in Theorem 19.5.3; it follows from the 
standard version by the proof of that theorem. 

To derive an estimate from (24.4.5) we shall of course have to make 
Yi clx, &)E4** strictly negative in some important region. This will be 
achieved by choosing q so that H,q<0 with strict inequality in some 
important region. However, we shall only be able to do so in the set where 
p(x, €)= €7 —r(x, &)=0, that is, we shall have q decreasing along the (broken) 
bicharacteristic flow. The following lemma will allow us to use this weaker 
positivity condition. 


Lemma 24.4.3. Let Y be an open subset of R’,={yeR’; y,20}, and let 
reC™(Y). We assume that r is real valued, that dr+0 when r=0 and that 
ér/6y, >0 when r=y, =6@r/0y,=0 for j+1. Let 


2 
Alt, y= 4,0) 0 
0 
be a quadratic polynomial in t with coefficients in C~(Y) such that 


(24.4.7) A(t, y)= —W(t,y)? = when t? =r(y), 


where weC™(IR x Y). Then one can find Wy,,,gEC™(Y) with Wo(y)=w(0, y), 
W (y) =O, y)/dt when r(y)=0, and 


(24.4.8) A(t, y)+ ol) +410) 0)" Se)? ry); telR, ye¥. 
Proof. If r(y)>0 we set r(y)=s? and observe that 


YoVt+¥i(yt=Wy) when t=+s 
provided that 


YoY=Ws, y+ ¥(—s, y)/2, WW=W6s,y)—v(—s, y)V/2s. 


In view of Theorem C.4.4 we can choose ¥;eC™(R x Y) such that 


Ws, y+W(—s, Y2=Pls*,y), (Ws, y)—W(—s, y)/25= ¥,(s°, y) 
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in R x ¥. Thus y ,(y)= ¥,(r(y), ye C™(Y) and 
BY y=Alt, y+ Wo +H Ot? -—@W)+ HOY)? -10) 


is linear in t and vanishes when t?=r(y). (We could also have applied the 
Malgrange preparation theorem here.) If we write 


Bit, y)=b,(y)t + oly) 
it follows that b,(y)=0 when r(y)20. We shall have 


. Be y) Sf) (t? -r)) 
if f(vy)=0 when r(y)>0 and f(y)2= F(y) where 


F(y)=[b, OMAr OI? + bol) OI. 


Here F=O(\r|") for any N in any compact subset K of Y. In fact, by the 
implicit function theorem we can for every yeK such that r(y) is negative 
and small enough find Je Y with r(f)=0 and |f—yi<Cir(y). Hence Lemma 
24.4.3 is a consequence of 


Lemma 24.4.4, Let Y and r satisfy the hypotheses in Lemma 24.4.3, and let F 
be a non-negative function vanishing for r>0 such that F =O(r™) for any N on 
any compact subset of Y. Then one can find feC®(Y) such that F<f and f 
=0 when r=0. 


Proof. Local solutions of this problem can be pieced together by a partition 
of unity, so we may work in a compact set K where r<0 at some point. Set 


F,Q=sup{F(x);r(Qjy2—t} if t>0; F,Q=0 if 1s. 
Then F, is an increasing function of t which vanishes of infinite order when 
t=0. Set 
Fi (=) Fi(ts) x(s)ds/s=f F,(s) x(s/t)ds/s 
where y€Cf (1, 2)) is non-negative and {y(s)ds/s=1. Then f,2F, since F, is 
increasing, f, =O when t <0, and 
Of (O=J Fi(s) n(s/t)ds/s=| F,(st) x4(s)ds/s 


where yy=y and y,,,()=—ky,(t)—ty,(QeCe. The right-hand side can be 
estimated by 

F,(2t) f|x,(s)|ds/s=O*") 
so all derivatives of f, tend to 0 when t->0. This proves that f,eEC®,-so f 
= f,(—1r) has the required properties. The proof is complete. 
Lemma 24.4.5. Let A,,¢S'~/-*(R" x R"~'); j,k =0,1; have real homogeneous 


principal symbols a,,(x, ¢') vanishing for large |x|, ag, =4 9, and assume that 


Dal 2) oy**= —W(x, 6)? — when Ef =r(x, €), 
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where weC™(IR" x (R"\0)) is homogeneous of degree 4. Further assume that 
Or/ox,>0 or that dr/d(x',é)+0 in supp aj,. Then one can choose 
B(x, )eS2(R" x R"—') and ¥,(x, é)eS~7(IR" x R"~') with principal symbols 
equal to wW(x,€) and dW(x,O/0€, when &,=r(x,&)=0, such that for any 
ue C%(IR",) and any KER 


(24.4.9) ReY(Ay(x,D’) Dk u, Di wy + | %(x, Dut ¥,(x, DD, ull 
SC(Wulles, ay +HD1 uO, ea HO, Mag +P ail, —1))- 


Proof. Choose C® functions W(x, &') and w,(x, €’) homogeneous of degree 4 
and —% respectively so that w(x, ¢’)=0/ w(x, 6)/0é4 when €, =r(x, €’)=0 and 
so that 


(24.4.10) Wail, 6) OFF * + ol, 6) + Wil 6) 1)? 
Sax, OMEt—r(x, &) 


for some geC® homogeneous of degree —1. This is possible by Lemma 
24.4.3, with y=(x, ¢) and t=¢,; the homogeneity is obtained by taking the 
restriction to the set where |é|=1 and extending by homogeneity. We can 
take y,; and g equal to 0 for large |x|. Choose ¥ and G with principal 
symbols y, and g, and consider 


(24.4.11) Re S (A ,(x, D’) u,,Uj)y + ¥ (P(x, D) F(x, D) u,, uj)x 
— (G(x, D')uy, Uy) +(G(X,DR(x,D) ug Moly. 


The principal symbol is negative semi-definite by (24.4.10), so it follows 
from the sharp Garding inequality that (24.4.11) can be estimated above by 
C(\lUoll,0) + lurll%, 1) aS pointed out above. We take u,=Diu and ob- 
serve that the sum of the last two terms in (244.11) differs from 
(—G(x, D') Pu, u)y by 


(G(x, D’) D? u, u)y —(G(x, D') DD, u, D, u) 
=i(G(0, x’, D’) D, u(0, .), w(O, «))ax (ED, G(x, DY] Du, u)x. 


Since [D,,G(x,D’)] is of order —1, the last term can be estimated by 
C|lull(o,0)!D1 lo, 1), and the boundary term can be estimated by 
C.D, uO, - ie) !4, gg. In view of Theorem B.2.3, the proof is 
complete. 


Remark. We shall sometimes refer to }’a,,é{** as the principal symbol of 
the form (A j,(x, D’) Di u, Di u)y. 


We are now prepared to start the proof of Theorem 24.4.1. We assume 
as before that X CIR”, that P is of the form (24.4.2) with the coefficients in 
CedR".), and that y=(0,,), &)=(0,£). Replacing X by a smaller neigh- 
borhood X¥=X,x[0,c) of 0 we may assume that WF(f) and therefore 
WF(u) contains no element (x,¢) over X,.x(0,c) with ¢'=0. These hy- 
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potheses are not affected if u is multiplied by a cutoff function which is 
equal to 1 near 0, so we assume that ueé’(X). Note that by Theorem 
18.1.36 and Theorem 18.3.32 it follows then that a(x, D')ueC™(X) resp. 
a(x, D')feC*(X) if aeS°(R"xIR"—') is of order —oo in a conic neigh- 
borhood of WF,(u) (resp. WF,(f)) projected to IR" x (R"~'~0). 

Now choose an open conic neighborhood W of (0, &) in X x (R"~' 0) 
such that 


(x, e)EW, x, +0=(x, C)EWF(S): 
(24.4.12) (0, x’, VeW=(x’, )¢WE,(fOWF(uy); 
(x, “)eW= dr(x, ax, >0. 


This is possible since W(f) and WF(u,) are closed and the hypothesis 
yeG, means that r=0, dr/¢x, >0 at y. Assuming that there is some y) +) on 
the bicharacteristic of p through y such that the bicharacteristic interval 
yo, 7] lies over W and »)>¢WF(u), we must prove that (0, €5)¢WE,(u). Let » 
lie after yy on the bicharacteristic; the proof is completely analogous when 
Yo lies after y. Choose an open conic neighborhood [fg of y) in T*(X°)\0 
with I,OWF(u)=6, and let W, be the set of ail (x,é)eW such that if 
r(x,é)20 there are bicharacteristic intervals, possibly broken by one re- 
flection or tangency at @X, which lie over W, have initial point in I and 
end point (x, +r(x, ¢’)*, €). For reasons of continuity W, is an open subset 
of W; it is clear that W, is conic and contains (0, &). We shall prove that 
x(x, D))ueC® if yeS° and cone supp yc W,; by Theorem 18,3.32 this implies 
that (0, €>)€WF,(u). (By cone suppy we mean the smallest closed conic set 
containing supp x.) 


By Theorem 24.2.1 and (24.2.4) we know that 
(244.13) {(x',€)3(0,x,,E)EWy, (x. S)EWE(W)} cG, 
if G, is regarded as a subset of T*(@X q), and that 
{(x, He WF(u); x, >0, (x, EW} 


is contained in the forward H, flowout of G, over W,; thus ¢,>0 there. 
(Recall that €, is strictly increasing on the (broken) bicharacteristics since 
ér/Ox,>0, and that dx,/dt=2¢, there.) In fact, if (x, €) is not in this flowout 
then the definition of W, shows that the bicharacteristic going backwards 
from (x,&) must reach I, after at most a transversal reflection while remain- 
ing over W. 

The proof of Theorem 24.2.1 gives more, 


(24.4.14) x(x, D’)(D, —A_,(x, Due C™(X) 


if yeS° and r>0 in cone suppycW. Here the operator D,—A,(x,D’) in 
the factorization (24.2.6) is defined on the symbo! level when r>0 so the 
product by x(x,D’) is defining up to an operator of order —oo. For the 
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proof we just have to observe that the bicharacteristic going backwards 
from a point (x, —r(x, &’)?, &) with (x, é)econe supp y remains over W until 
it has entered [,, and that r(x, é’)=é?>0 there because &, decreases in the 
backward direction on a bicharacteristic. 

We shall also need a microlocal form of Theorem B.2.9. 


Lemma 24.4.6. Under the hypotheses above, if x(x, D ueH (m,s) for all yeS° 
with cone supp ¥ < W, then x(x, D')ueHy,. 15-1) for all such y. 


Proof. Since 
Di x(x, D')u=x(x, D) f+ R(x, D) x(x, Dut [P, x(x, DY] u 


and [P, x(x, D’)]=7,(x, D') D, + x(x, D’) with x; of order 1 —j and order —a 
outside cone supp xy, we have D7 x(x, D’)u€ Hn _1,s—1)- From Theorem B.2.3 
we now obtain first that D,xz(x,D’)ueH,,,;, and then that 
x(x, Due Hone 1,3—1)° 


Since Ue Hn, 5) for some m, s we conclude by repeated use of Lemma 
24.4.6 that for some s 


(24.4.5), x(x%,D')ueHy 1,  x0,D)D, uly, 0€ Hey (R"~') 


for all yeS° with cone suppycW,. The proof of Theorem 24.4.1 will be 
completed if we show that (24.4.15),=>(24.4.15),.,, for by Lemma 24.4.6 we 
can then conclude that y(x,D’)ueC® for all yeS° with cone suppyc W. 
The decisive step in the induction is the following 


Lemma 24.4.7. Assume that q(x, é)=4,(x, €')€,+4q9(x, &) where the 
coefficients q;e¢C™(IR" x (IR"~'\0)) have support in W, and are homogeneous 
of degree —j, and let q,(0,x', &)= —t(x’, &')? for some teC®. Further assume 
that for some constant M 


(244.16) {p,q}+qMISl|=—wW’+p(f,—1?) and q=v" when p=0, 


where w,veC™(IR" x (R"\0)) have support over W, and are homogeneous of 
degree 4 and 0, while peC™(IR" x (IR"~!~0)) is homogeneous of degree 0 and 
r>0 in supp p, which is contained in W,. If u satisfies (24.4.1), (244.15), and 
M is larger than some number depending on P and on s, it follows that 
T(x’, D')D, ul, 9 €H,.(R"~*) if t is the principal symbol of T, and %(x,D')u 
+ P(x, DD, ucHo, 5 for some YeS*~) with cone supp ¥,< W, and principal 
symbols 6! w(0,x’,0, €)/6 4 when x, =r(x, €)=0. 


Proof. First note that for reasons of continuity the energy identity (24.4.5) is 
valid for all ueH,.,. Choose ye¢C® with cone suppy<W, homogeneous of 
degree 0 for |¢|>1, so that y is then equal to 1 in a neighborhood of suppq, 
for 7=0,1. We shall apply (24.4.5) to 


u,=(1+e7|D1?)" "(1 +1D'?)"? x0, Du, 
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taking Q=Q,(x,D’)D,+Qo(x,D’) where Q,(x, 2)=q,(x, ¢’) when |¢'|>1. By 
(24.4.15), and Lemma 24.4.6 we have u,¢H,3 _,)¢H,., when e>0, and yu, is 
bounded in H,, _,) as e0. The lemma jail follow when e—0 after appro- 
priate estimates have been given for the terms in (24.4.5). 

We start with studying Pu,. Writing 


A,=(1+2)D'P)- (1 41D PY? 


we have 
Pu,=f,+[P,4,x7(%, D’)] u. 


Here f= A, X(% D’)f is bounded in He: o) since x(x, D') feH o, oy and the 
second term is bounded in As. _4) by (24.4.15),. To study Im(Pu,,Qu,) we 
must make a more detailed analysis so we write 


(LP, A, x(x, DY] u, Qu,)=(A,[P, x(x, D’)] u, Qu,) 
+([P,A,] A>" u,,Qu,). 


The first term is bounded as 0 since QF A,[P, x(x, D’)] is of order — oo in 
D'. To deal with the second term we write 


G*(x, D')=[P, A] Ap*=A4,[A7*, PJ=(1 +1D1?)"? [(1 +1D11?)-°?, P] 
+e7(1 +e7|D'[?)7 "(1 + |D'?)°7 []D'?, PI (1 +|D?)- 9°. 
G*(x,&’) is bounded in S! since (e~?+|é?)-1 is bounded in S-? when 
0<e<1. The leading term is purely imaginary. Let 
M +2Im(G*(x, 2) —p(x, ENA +|é)?)"721 

for 0<e<1 and all (x, &), and set 

S*(x, E)=(1 +16'17)*(M +2 Im(G*(x, &) — p*(x, &)) (1 +161?) # 
which is bounded in S$* as 0<e<1. Since the symbol of 

M(1 + |D'|?)? +2G*(x, D’Vi—2 Im p*(x, D’)— S*(x, D')* S*(x, D’) 
is bounded in S°, we have 

2 Im(G*(x, D’)u,, Qu,) = Re((2 Im p*(x, D’)— M(1 + |D'|”)?) u,, Qu,) 
+Re(S*(x, D')u,, QS*(x, D')u,)— C, 


for some C,. Here we have also used that [S*(x, D’),Q] is bounded of order 
—4. With v,=(1+|D'|?)~* S*(x, D’)u, the term involving S* can be written 


Re) (A j,(x, D’') Di v,, Dj v,) 
where 
A,,=0, Ago = (1 +|D'?)* Qo(x, D) (1 + |D')?)*, 


A1o =Apo, =5(1 +|D'|?)* QO, (x, D'\(4 +|D'|)*. 


The principal symbol is |é|v? when p=0, so we can apply Lemma 24.4.5. 
As already observed |u,|(1,1)+[Pu4lio,_1) is bounded as e0. Hence 
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ola, 1+ 1Pe,lo, 1) is bounded and so are the other terms in the right- 
hand side of (24.4.9) when u is replaced by v, and x is large. In fact, v,(0, .) 
is bounded in H,,,). Summing up, we have proved that 


(24.4.17) 2Im(Pu,, Qu,)2Re((2 Im p(x, D'))—M(1+|D'|”)*)u,, Qu,)— Cy. 
Next we shall derive an upper bound for the right-hand side of (24.4.5) 
applied to u, by using Lemma 24.4.5 to estimate 
(24.4.18) Re }\(C4(x, D’) Di u, Dj u) 
+Re((M(1 +|D'|*)? —2 Im p°(x, D’)) u,, Qu,) 
—Re(o(x,D')(D, —A,(x,D))u,, p°(x, D')u,). 
Here C,, comes from Lemma 24.4.2, p(x, &)EC® is equal to p(x, &’) when 
|é|>1, and deS° is equal to 1 in a neighborhood of supp p®, cone supp ¢ 


<W,, and r>0 in cone supp ¢. When p(x, €)=0 the corresponding principal 
symbol defined as in Lemma 24.4.5 is 


{p,q} +qM|é|—p(é,-r)= -—W’. 


Hence the estimates for u, and Pu, already used in the proof of (24.4.17) 
show that (24.4.18) can be estimated above by 


C3—|%(x, Du, + %(x, DD, u,|1z 


where ¥; is related to w as in Lemma 24.4.5. Using (24.4.14) and the fact 
that [6(x, D’))(D, —A,(x,D’)), A, x(x, D’)] is bounded in S*, we conclude that 


(24.4.19) Re($(x, D')(D, —A, (x, D)) u,, p°(x, Du) S C4. 
Summing up, we have therefore proved that 
(24.4.20) || %(x, Du, + ¥(x, D)D, u,||2 
$C,+C,+C,+Re> (By (x', D') Di u,(0, .), Di u,(0, - )Jox- 
Since B,,+ T(x’, D')* T(x’, D’) is of order —2, we have 
Re(B, (x, D)u,1,Uerox + |] Tes lox S Cllu, 10, Niys Cs. 


The other boundary terms in (24.4.5) are bounded when ¢—0 since u,(0, .) is 
even bounded in C™ by the second part of (24.4.12).. Thus 


|| %o(x, D')u, + % (x, D’) D, u,llz+ | Tu,y lox SCes 


and the lemma follows when ¢—0. 


From Lemma 24.4.7 it follows that D,uj,,_9 is in Hi, 4) at (x',¢’) if 
t(x’,é)+0. If using different choices of q we obtain Y%(x,D')u 
+ ¥,(x,D)D, ueH o,,, for two different systems %,'%, of order 3, —5 form- 
ing an elliptic system in cone suppy, where yeS°, then it will follow that 
Ax, DueH oy ..4, 10,D)D,ueHo,, 4 which is a local form of 
(24.4.15),,4. Thus the main point which remains is to construct operators 
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satisfying the conditions in Lemma 24.4.7. This is fairly easy since (24.4.16) 
is only a condition on the restriction of g to the surface p(x, €)=0. At first 
we can therefore ignore the condition that g shall be linear in €,. Before 
giving the construction we prove two technical lemmas which will enable us 
to take the square root t. 


Lemma 24.4.8. If yeC™(IR) is non-negative and 
(24.4.21) [D* x1 Chain x" 


for all multi-indices « and all k<\, it follows that y+¢C™(R%). The set of all y 
satisfying (24.4.21) with fixed C,,,, is convex and contains the origin; the 
product of two such functions has the same property with C,,, replaced by 
22 

Ja], ke 


Proof. In the set where y+0 we can write D*y?* as a linear combination of 
terms of the form 
xy? ID" x... Dx; 
this follows at once by induction. Hence 
|D* x Ss Cage" ae mee 
where k'=1—2j(1—k)>0 if k>1—1/(2/). By Corollary 1.1.2 this proves 
that y*€C®; in fact, estimates of the form (24.4.21) with other constants are 


valid for y*. If y=4,7%,+4,x%, where 7, satisfy (24.4.21) and A, are non- 
negative reals with sum 1, then 


[D* x|< Chala Xi +A, US Cya,a(A1 tit, La)" = Craik x 
by Hélder’s inequality. This proves the convexity, and the last statement 


follows from Leibniz’ formula. 


Lemma 24.4.9. If yeC*(R%) and y=@, W satisfies (24.4.21) then 


(W(t, +lo,to,---,ty)-—W(ty —to,tg,.--,ty)\/2to= b(tG,t1, siege 
where de C™(IRN*}), 


Proof. The difference quotient can be written in the form 


1 
POpussty =) 2G, +3tytiacst ds. 
-1 


The integrand obviously satisfies (24.4.21) for fixed s with no change of the 
constants, for differentiation can only bring out powers of s. In view of the 
convexity proved in Lemma 24.4.8 it follows that F has the same property, 
hence F#eC®. By Corollary C.4.4 this even function of ty can be written in 
the form $(t2,t,,...,ty) with 6¢C°(R%**). The proof is complete. 
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An example of a function satisfying (24.4.21) is 
x7of)=exp(—1/) if t>0; yx,(t)=0 if ts. 
We shall use x. as a cutoff function below. Also x@ satisfies (24.4.21). 


Proof of Theorem 24.4.1. We shall now construct functions q satisfying the 
conditions in Lemma 24.4.7. Let O,y,m)eW, (’. neG,, |n'|=1, and set 


(24.4.22) (x, =i (% C6, + Pol, ¢'), 
(244.23) P,OGE)=INE, bol &)=x7 + Ix’ - YP HIEAS 0? 
Then @ is an increasing function of €, and 
Hb ={07 1.91 $1 + Go} =F1(20b0/0%, — fr, bi) + {E157} O1 — fr bo}. 


In a conic neighborhood W, ,.cW, of (0, y,7') we have by (24.412) for 
some positive constants c and C 


(24.4.24) _ {E,,7} =0r/dx, 24clé'/?, 
{reo Sele, [20po/0x, — ir, OSC. 


Hence H, @>0 when |€,| CSclé'|. ; 
Choose 6 so small that (x, @)eW,,,, if @o(x, ¢')S$36, and consider as a 
preliminary choice of qg 


F =22($0/5) X01 — 4/9) 


where yo is defined above and x2 € Cj°((—3, 3)) is non-negative and equal to 1 
in (—2,2). Later on we shall also need a cutoff function x; € CQ°((—2,2)) 
which is equal to 1 in (—1,1). In supp f we have <6, hence €, Sd|é’|, and 
in supp fnsupp dy,(¢,/d) we have 6,+¢, €,546, 6,226, hence €, < —4|€'. 
This implies r= 67|é'|? if p(x, 2)=0. Thus 


H,f+fMIC\=—w?+p(€,—-r*) when p(x, 6)=0 
where 


(24.4.25) w=x1(6,/51E') N3, 

(24.4.26) —2r? p= —(1—7,(E,/51611)”) N +X0(1 — 6/6) H, x2(G0/6)’, 
evaluated for €, = —r? if r>0, and p=0 if r<0. Here we have written 
(24.4.27) N=x($0/9)"(xo(1 — 4/6) H, $/5 —X0(1 — $/5) M1é')). 


In the support of w we have |, €,|S26 and $),<36, hence 1— — 6/583. 
When t <3 and [10a] <1, it is clear that 


(1 —axo(t)/xo(t))* =( —at?)* 
is a C® function G(a, t). We can write 


N* =42($0/6) (Holl — 6/5) H, $/5)* G(M|€'| 5/H, >, 1-4/9) 
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and conclude for small 6 that weC®. In the support of (1—y?)N we have 
&,<—6|é|, hence r267|Z'|?, so it follows at once that p is also in C® and 
that r26?|? in the support. With v=y,(E,/lé'l)22(¢o/6) xo(1 — 4/8)? we 
have ve C® and v?=f when p=0 if 6 is small enough. 

Now we define q(x, €) as the remainder when f is divided by p using the 
Malgrange preparation theorem, 


F(x, S)= (x, 6) B(x C+ 41(% O51 +40 (x 61). 


Since {p, f}={p,q} when p(x, £)=0 the condition (24.4.16) follows. When 
r(x, €’)>0 we have 


ak, é') = FQ, &1,€') —f@, —€1,€/))/21, if e = rast); 


Since —Of/0E, =73(b0/6)? xo(1 —¢/5) ,/6, it follows from Lemma 24.4.9 
that the right-hand side is equal to —F(&?, x, £’)* where FeC®, so q,(x, &) 
= —F(r,x, &)? when r>0. Without loss of (24.4.16) we may change q, to 
—t? where t=F(r(x, £), x, €’), and have then verified all the hypotheses of 
Lemma 24.4.7. If w, is the principal symbol of % we have at (y’,n’) resp. 
(0, y’,7’) 
t=(x0(1)9,/d)*, Wo =xXo(1) Or/dx, $,/5—Xo(1) M, 
2H W1 = —Xo(1) Or/Ox, G7/6? —xo(1) {r, 13/5 + Xo(1) M4/O. 


Thus ¢ is non-characteristic at (y',7') and w,/) is close to 
—,X9(1)/2x%5(1) 6 at (0, y',7’) if 6 is small. This ratio is different for two 
different small choices of 6. By the discussion preceding Lemma 24.4.8 we 
have therefore proved (24.4.15),,, if the support of x is sufficiently close to 
(1). 

To finish the proof we must establish a similar result at the H, flowout 
of Gz M Wo in Wo. This can be done by using the proof of Theorem 24.2.1 
again. First we choose an operator Q(x, D’) of order 0 with support close to 
the bicharacteristic going out from a point in Gg M Wo and commuting 
approximately with D,—A, there, except when x, is so small that one has 
entered the set already controlled; Q is then cut down to 0. Then 


(D,—A,.(x, D) (Q(x, D)(D,—A_(%, D) ue Ho, 54) 


by the result already proved. From Theorem 23.1.2 it follows therefore that 
Q(x, D')(D, — A_(x, D’ ))ueH (o,s-4)- Logether with the already established 
fact that x(x, D’)(D, -A .(x, D))ueC® when xeS° and r>0 in cone supp x 
cW, this implies by standard elliptic theory that (24.4.15),, , is valid when 
suppz is in a sufficiently small conic neighborhood of a point on I The 
proof of the theorem is now complete. 


Remark, It is clear that the proof yields a more precise result on microlocal 
H,,, regularity. We have omitted it for the sake of brevity. (Cf. Theorem 
26.1.4 for the corresponding interior regularity theorem.) 


24.5. The General Propagation of Singularities 455 
24.5. The General Propagation of Singularities 


We shall begin by proving property (ii) in Lemma 24.3.15 for WF,(u) when u 
is a solution of the Dirichlet problem. Note that (ii) is a consequence of (i) 
in the diffractive set, so we can work in the entire glancing set omitting the 
subset where the gliding vector is zero since the generalized bicharacteristic 
through such a non-diffractive point is trivial by Corollary 24.3.10. 

As in Section 24.4 we assume that P is of the form (24.4.2) with the 
coefficients of R of compact support. Let ue.W”(X) be a solution of the 
boundary problem 


(24.5.1) Pu=f in X NOX, u=uy in OX 
where X is an open subset of IR", and @X =X ndIR",. We may assume that 


u has compact support in X and that (x, )«WF(u), x, >0, implies ¢’+0. Let 
W be an open conic set in X x (IR’~!~0) such that 


(24.5.2) (x, )EeW, x, +0 =(x, QEWF(S), 

(0, x’, C)VeW => (x', C)EWE,(f) UV WF (uo). 
By K we denote a compact subset of {(x’, eG, (0, x’, EJeW} such that 
(24.5.3) H,,(x’, €) is not radial when (x’, é’)eK. 


It will be convenient to assume that |¢’|=1 in K. For every point in K we 
can choose NeC™(IR"~! x (IR"~1~.0)) homogeneous of degree 0 so that H ro N 
=|é’| in a conic neighborhood. Splitting K into a finite number of compact 
sets we may therefore assume that 


(24.5.4) {7>,N}=|é'| in a conic neighborhood of K. 
From Lemma 24.4.6 it follows that for suitable s 
(24.5.5), x(x, D)ueH oo, if ~eS°, cone supp xc W. 


Our purpose is to improve this at a point where there are fewer singularities 
near the backward gliding ray than asserted by condition (ii) in Lemma 
24.3.15; the proof for the forward gliding ray is similar and is left as an 
exercise for the reader. (Recall that the gliding ray is the orbit of —H,, 
where r,(x’, €) =r(0, x’, €’).) In the following statement we use the notation 


WF’ (u) = {(x, €); (*, €)EWR(u) if x,=0 and 
(x,é,,€)eWF(u) forsome €, if x,>0}. 
Proposition 24.5.1. Assume that (24.5.1)-(24.5.4) are valid and that |n'|=1 if 


(y’, "eK, If e>0 it follows that there is some 6,>0 such that if (y’,n)eK and 
for some de€(0, 6,) 


(24.5.6) x, <e6,|(x', £)—(y/, 1) 4H, ,(y', 1) <e6 > (x, 2) WF (u) 
then (y’, 1) ¢WF,(u). 
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Since exp(6H,,)(y', n')=(y',1')+ 6H,,(y’, 1’) +0(67) it follows from (24.5.6) 
that for small enough 6 
(24.5.6Y x, <6, |(x’, ¢’)—exp (6H,,)(y’, 1')| <26/2 
=> (x, C)¢ WF’ (u). 


This form of the condition will be more convenient below. 
The main point in the proof is a slight variation of Lemma 24.4.7. 


Lemma 24.5.2. Let q, v, b, ee C”(IR" x (IR"~!\0)) be homogeneous functions of 
degree 0,0,4,4 respectively, all with support in W, satisfying 


(24.5.7) {p,q} +qM|e'|= —W? —b? +e?, q=v* when p=0. 


Here weC®(IR" x (IR"\0)) is homogeneous of degree 4 and has support over 
W, and M is a constant. Choose B, EeC®™ equal to b and e outside a compact 
set. If (24.5.1}-(24.5.4), (24.5.5), hold and M is larger than some constant 
depending on P and s, it follows that B(x, D')u€Ho,,, if E(x, D')ueH ,,). 


Proof. We follow the proof of Lemma 24.4.7 closely, with q independent of 
€, now. First we define u, as there. The proof of (24.4.17) requires no 
change. Instead of (24.4.18) we consider the form 


Re (Cy (x, D’!) Di u,, Dj u,) + Re ((M(1 +|D'[?)# -2 Im p(x, D’)) u,, Qu,) 
+ || B(x, D’)u,||* — | E(x, Du, |? 


which has the principal symbol —wW(x, €)? when p(x, €)=0. Since u,(0,.) is 
bounded in H,,, for every x we conclude as before that this is bounded 
above as e—0. (We drop the positive contributions which could be obtained 
from w as in Lemma 24.4.7, for the term involving B is now the essential 
one.) The boundary terms are all bounded since Q, =0. A new point is that 


E(x, D')u,=A, x(x, D’) E(x, D))u+ [E(x, D’), A, x(x, Du 
is bounded in LI? as e>0 because E(x, D'\ueHo, 5 by hypothesis and 
[E(x, D’), A, x(x, D’)] is bounded in S*~* as e>0. Thus we have proved that 
|| B(x, D’)u,|| is bounded which by the same argument implies that 
| A, x(x, D’) B(x, D’)ul| is bounded when e—0. Hence (x, D’) B(x, D'ueH oy 
which proves the lemma if y is taken equal to 1 in supp B. 


Proof of Proposition 24.5.1. For every (y’,7')eK we choose a function 
o(x’, &)=w(x’, &’; y’,7') im a conic neighborhood by solving the Cauchy 
problem 

(24.5.8) H,,@=0; a(x’, &)=|x’—y/? H1E1E|—n'? for N(x’, OV=N(y,n). 


By (24.5.4) there is a unique solution when |x’—y|?+|@/|¢|—n'|? is small 
enough. It vanishes of second order on the bicharacteristic of ro through 
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(y’, 7’), hence on the cone it generates, but the Hessian is positive in direc- 
tions transversal to it since this is true at (y’,7’). Since 7, vanishes on the 
bicharacteristic we obtain 


(24.5.9) rol S$ Cwm {E?. 

Similarly we have 

(24.5.10) |0w/0x'|? +|0w/0E?|E/? < Co, 

which is also a consequence of Lemma 7.7.2 since 20. With 
(24.5.11) (x, EV=N(y, n)—N(X’, &)4+-x,/e+ o(x’, &Y/de? 
we obtain 


H, =H, N+2,/e—H, o/6e”. 

By (24.5.8) and (24.5.10) 

H,o=(H,—H,,)@=0(x,)O(/&|@*), 
and on the characteristic set we have 

ET =rSryt+O(x,|61?)=O(w* +x, )[E?. 
When N(x’, é)—N(y,7')S26 and (x, é)S26, we have x,<46e and 
o(x’, €’)<467e?. For small’6 it follows that 
(24.512) H,b={r,N}+O(6/e)*)|E1, if p(x, =0, o(x, £826 | 

and N(x’, &)—N(y’, 7’) S26. 

The first term is larger than |&’|/2 by (24.5.4) if 5 is small so it dominates 


then. 
As in the proof of Theorem 24.4.1 we set 


Ao(t)=exp(—1/t) if t>0; y,(t)=0 if r<0. 


Let xi(t) = 0 for t < 0, x:(¢) = 1 fort > 1 and 0 S x; € CP°((O, 1). We choose 
Xj Satisfying (24.4.21) and conclude that x; does so too. For (y’, 7’) € K, a small 
é,6 <e, and 0 St < 1, we now introduce 


(245.13) q(x, €)=Zo(l +¢—-G/5) x1, (NO, 1) — N(x’, 6) + d)/ed + 0). 


It is clear that g, is homogeneous of degree 0. The support of q, is in the 
interior of that of g; when t<T because x) and y, are increasing. In the 
support of the first factér we have @<6(1+1)<26, and in the support of the 
second one we have N(y’‘,7')—N(x’,€)+d2 —ted. It is equal to 1 when 
Ni, 1) —-N(x’, €)+62(1—d 6 so it constant except in a narrow strip. Set 


(24.5.14)  e,=(xo(1 +t— 9/5) x, ((N(, 1) —N(X, 6) + 0/65 +0) {r, N}/e0)*. 


‘This is a C® function homogeneous of degree 4, for yi and y+ are C™ 
functions and {r,N}>0 by (24.5.4). If (x,é)esuppe, then we have 
IN(y’, 1')— N(x’, C') +6] Sed, hence N(x’, o’)—N(y’,1') S26, so x,S4ed and 
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a(x’, €)) <4? 57. If (y/(6), 1’ (6) = exp (6H,,)(y’, 1’) it follows from (24.5.4) that 
IN(y'(6), 1'(5)) — N(x’, €)| Sed + 0(6"). 

For small enough 6 we obtain 

(24.5.15) xz + |x’ —y'(A)l? +16" In’ (DE —0' (6)? S C? (26)? 


since |N(x’, &’)~N(y'(6), 7’(6))|? + a(x’, ©) is equivalent to the square of the 
distance from (x’, é’/é']) to the ray through (y’(6), 7’(6)). Assuming that 
(24.5.6)’ is valid with ¢ replaced by 2Cé, we conclude that supp e,7 WF’ (u) 
=), hence E,(x, D')ueC™. 

- To apply Lemma 24.5.2 we must also choose b, and w, so that 


(24.5.16) x, (xo(1 +t 4/8) H, 6/5 — MIE xo(1 +t — 4/5) =i + br. 


Here the argument of y, is the same as in (24.5.13). Recalling (24.5.12) we 
set 


(24.5.17) bx, 2) =(x1 oll +t — 9/5) |E'1/26)?, 
(24.5.18) Wel, 2)=x7 (voll +4 — $/5) (Hy d —16'1/2)/5 —xo(d +t — 6/9) MIE)3. 


It is clear that b,¢C®. In the support of g, we have 1+t—¢/6S4, and 
H,¢22\¢ ee in the intersection with the characteristic set by (24.5.12). Since 
Lo(N75(9= s? we obtain for small 6 that y,eC® in a conic neighborhood of 
the characteristic set, for we can first break out the smooth factor 
x¥o(1+t—$/6)*|¢’/5)?. We can multiply w, by a cutoff function with support 
in this neighborhood which is 1 in the characteristic set. 

Having verified all the hypotheses of Lemma 24.5.2 we conclude that 
B,(x, Due H,,, 4). Let 


W,= {(x, €); b, (x, &') > 0}. 


This is a conic neighborhood of suppq, if t’<t, and we have (24.5.5),,, 
when cone suppycW,. For any t<1 and any integer k we can apply this 
argument k times with t replaced by ¢,,...,t, where t<t,<...<t,<1. Then 
we obtain (24.5.5),,,, when cone suppyc W,, so x(x, D’)ueC™ then in view 
of Lemma 24.4.6. By Theorem 18.3.32 it follows that (y’,n')€WF,(u), which 
completes the proof. 


Using Lemma 24,3.15 we can now sum up (24.2.4), Theorems 24.2.1 and 
24.4.1, and Proposition 24.5.1 as follows: 


Theorem 24.5.3. Let P be a second order differential operator with real 
principal symbol p and C® coefficients defined in a C® manifold X with non- 
characteristic boundary 0X. Let uée.N(X) be a solution of the boundary 
problem (245.1) with feNW(X) and u,eG'(eX). Every y eWF,(u) 
~ (WE,(f) U WF (uo)) is then either a characteristic of P in the interior of X or 
else a point in the hyperbolic or the glancing set <T*(0X). An open interval 
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(—T, T)atr+f(t) with #(0)=y,_ on a compressed generalized bicharacteristic is 
contained in WF,(u). 


Note that the last statement is empty if H,(y9) is radial or if HF (yo) is 
radial and y, is non-diffractive. The generalized bicharacteristic is then just 
a ray. Otherwise singularities continue to move along a generalized bichar- 
acteristic as long as it stays in a compact set, does not reach the singulari- 
ties of the data f, uy and does not converge to a singular point of the type 
just described. To show that this result is optimal one should extend 
- Theorem 24.2.3 to generalized bicharacteristics. Only a weaker result is 
known though: 


Theorem 24.5.4. Let P be a second order differential operator with real 
principal symbol p and C™ coefficients defined in a C® manifold X with non- 
characteristic boundary 0X. Let [a, b]3t++¥,(t)} be compressed broken bichar- 
acteristic arcs nowhere tangential to the fibers of T*(X), converging as y->00 
to a compressed generalized bicharacteristic {a, b|att+ f(t) which has injective 
projection to the compressed cosphere bundle and end points ¥(a), (b) over 
X°. Then one can find ueN(X) so that WF,(u) is the cone generated by 
3((a, b]), WE,(Pu) is generated by ¥({a, b}), and u=0 on 0X. 


Proof. It is enough to find uéC(X) vanishing on @X such that WF,(Pu) is 
contained in the cone I’ generated by j(a) and $(b) while WF,(u) is con- 
tained in the cone [ generated by #([a,b]) but not in I’. Indeed, then 
ueN(X) by Corollary 18.3.31, and it follows from Theorem 24.5.3 that 
WF,(u) =I, WF,(Pu) =I". The functional analytic setup in the proof of Theo- 
rem 8.3.8 can still be used here. Thus we denote by ¥ the Fréchet space of 
all u€C,(K) vanishing on 6X with WF,(Pu)cIr' and WF,(u)cr, where K 
<X is a compact neighborhood of the projection of j([a,b]) in X. The 
topology can be defined by semi-norms of the following types: 


(24.5.19) utsup |ul; 
(24.5.20) ut>sup|APu| where Ac ¥~(IR") is properly supported 
and }({a, b}) \WF(A)=9; 
(24.5.21) ut+sup|Au| where Ae P~(R",) is properly supported 
and 9([a, b])\ WF(A)=9; 
(24.5.22) ut+sup|D*Pul| where « is any multi-index and K’ is a compact 
x 
subset of K with the projections of }(a) and 3(b) removed; 


(24.5.23) ut+sup|a(x, D’)ul where aeS~(R"xR"—') vanishes for large |x| 
and is of order —oco in a conic neighborhood of the pro- 
jection of #({a, b]) in R" x R"-1. 


Let K,cK be a closed ball with center at a point x, in the projection of I 
such that K, and the projection of I’ are disjoint. The theorem will be 
proved if we show that there is some ue¥ which is not in C'(K,). If every 
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ueF is in C'(K,) it follows by the closed graph theorem that when |a|=1 
5 ; 
(24.5.24) sup|D*ul SCY N(u), ueF, 
Ko 1 


where N, are semi-norms in ¥ of the type (24.5.18+j). We shall complete 
the proof by showing that (24.5.24) cannot be valid. 

Choose v so large that the conditions imposed on the seminorms of type 
(24.5.20)-(24.5.23) occurring in (24.5.24) remain valid with 7 replaced by j,. 
We shall apply (24.5.24) to functions U, constructed as in the proof of 
Theorem 24.2.3 for ¥,, but with a factor e~* inserted in (24.2.10). (We may 
assume that a*(x,A) has support in K for every 4.) This new factor is 
bounded in S° as e-0. The proof of Theorem 24.2.3 therefore yields uni- 
form bounds for N,(U,) as e0 if 2<jS5. It is also obvious that sup|U,| is 
uniformly bounded. Assuming that x, is only the projection of a single 
point in 9,([a,b]) we obtain from the discussion of D,(U*—U_) in the 
proof of Theorem 24.2.3 (preceding the statement) that for some « with |c| 
=1 we have |D*U,(x))| > 0 as ¢-0. Thus (24.5.24) cannot hold, and the 
theorem is proved. 


Not every generalized bicharacteristic is a limit of broken bicharacteris- 
tics. Indeed, if (y’,7')EG, and r,=0 in a neighborhood, then there can be no 
reflection point nearby. Since r(x, €)>0 when x,>0 in a neighborhood of 
(0, y’,7’) it follows that €,+0 on a bicharacteristic so x, cannot have a 
minimum other than 0 on a bicharacteristic. In Section 24.6 we shall prove 
by a special argument that nevertheless there is a solution which is singular 
on a given bicharacteristic of this type. It is not known if there are other 
generalized bicharacteristics which are not limits of broken bicharacteristics 
and give rise to a gap between Theorems 24.5.3 and 24.5.4. However, in 
Example 24.3.11 it is clear that both the polygonal and the gliding bicharac- 
teristic constructed through the point in G™ are limits of broken bicharac- 
teristics. Hence there is no certain way of predicting the future path of a 
singularity in general. 


24.6. Operators Microlocally of Tricomi’s Type 


In this section we shall discuss situations where the results of Section 24.5 
are inadequate. Both are related to the Tricomi operator 
Di —x,(D3+...+D?) 


in the hyperbolic or elliptic half spaces defined respectively by x, >0 and 
x,<0. In the first case Theorem 24.5.4 is not applicable since all the 
bicharacteristics have cusps when x, =O (see Fig. 6). In the second case the 
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> % 


Fig. 6 
Propagation Theorem 24.5.3 is empty but we shall see that there is hy- 
poellipticity. We shall discuss this situation first. 


Definition 24.6.1. The hypoelliptic set HE< T*(¢X)~0 is the intersection of 
the gliding set and the complement of the closure of the hyperbolic set. 


If X as usual is defined by x,20 and the principal symbol p of P is of 
the form €{—r(x, €’) then 


(24.6.1) HE = {(x’, €); ér(0, x’, E)/ex, <0, r(0, yn) <0 

in a neighborhood of (x’, &)}. 
The terminology is justified by the following analogue of (24.2.4) 
Theorem 24.6.2. Let u, fe W(X) and Pu=f, u=u, on 0X. Then 
(24.6.2) HEd WE, (u) = HE 0 (WE,(f) VU WE (uy). 


The proof of the estimates leading to Theorem 24.6.2 will require the 
following lemma: 


Lemma 24.6.3. If ve C% (IR ,) and a>0 then 
(24.6.3) iol] SO x40] MEF yy yaar 
where || || is the L? norm. 


Proof. Let y(x)=1 for x<1, y(x)=2—x for 1Sx<2 and y(x)=0 for x22, 
and set w=yv. Then ||v—w| S||x70]| and 
wl Six'ol + [lxv'l| Sl]x* oll + he'd. 
Since es at 
|w|? =f |wi? dt= —2Re J tww' dt S4||wI| Iw] 
0 0 : 
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it follows that 
lw] S4 lw] S4 [xt] +4 lle’, loll S5 [x70] +4 [lo 
Now replace v by v(./t) where t>0. Then we obtain 
|p| $5 |x*o] +4 |e" /t. 
When t is chosen so that ||x7vj| t**+= ||v’|], the estimate (24.6.3) follows. 


An immediate consequence of (24.6.3) is 
(24.6.4) flvllo,y S94 SSP ID, vle5?, ve CPR"), 
where s,e€R and s=(sy+as,)/(a+1). In fact, if V is the partial Fourier 
transform of v with respect to x’=(x,,...,x,) then 
IV (., EV Slt VO, EQ PD VG, ENver? 
by (24.6.3). If we square, multiply by 
(LIE P= (141 PCL LE AND 


and integrate with respect to ¢’, then (24.6.4) follows by Hélder’s inequality. 
Similarly we obtain 


(24.6.5) 12, Dey S 2lellosolDPr vilosy, $= Go +s1)/2, 
if we start from the estimate 


|v(0)|2= —2Re f vd’dt<2|ol| |v’, ve CRAR,). 
0 


Lemma 24.6.4. Let p(x, €)=€7 —r(x, ¢’) where reS? and 
(24.6.6) r(x, VS —ex,|f'?, 0<x, <1, 


for some c>0. If ueCX(IR",) and x, <1 in supp y, it follows that 
2 


(24.6.7) Y/Di~ullo, 243) S CUP D) ulleo, 0) + Hullo, 0) + 4, -IIhcsyay): 
1) 


Proof. An integration by parts gives 
(p(x, D) u, Dj u)= ||Dj ull? —(r(x, D) Dy u, D, u) 
+([r(x, D’), D,J uy, D, u)—(r(0, x’, D’) u, 0, u), 

where the last scalar product is in L? on the boundary plane defined by 
x,=0. Since it follows from (24.6.5) that ||@, u(0,.)\|,,, can be estimated in 
terms of the left-hand side of (24.6.7), we estimate this term by 
u(0, -)licg, 12, uO, -)Il4). We have [r(x, D’), D,]= C(x, D’) where C is of second 
order and Re C is of first order. Thus 


(C(x, D')u, D, u)+(D,u, C(x, D’)u)=({D,, C(x, D’)]u, u) 
+((C(x, D')+ C(x, D’)*) D, u, u)—i(C(O, x’, D’)u, u) 


where the last scalar product is in the boundary. Since [D,, C] is of second order 
and C+ C™ is of first order we obtain when taking real parts 
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(24.6.8) \|D? ull? —2 Re(r(x, DD, u, D, u) 
S |lp(x, D) ull? + C((l4llo, 1+ 1D1 4lo,0)) lio, 1 
+ |u(0, Jig) |D;, 40, DI). 


Now we use the hypothesis (24.6.6). By the sharp Garding inequality 
(Theorem 18.1.14) applied for fixed x, we have 


c|x7D, ull, 1) —Re(r(x, D')D, u,D,u)+C|D, ulléo, 4) 
Hence 
(24.6.9) Dull? +cl|xFD, ull, 1) 
S||p(x, D) ull? + C(llullo, 1) + 1D 1 ull, 3+ 4 lg Ds uO, ly 
Now (24.6.4) shows that we can estimate ||D, ull2 (0,3) by means of the left- 


hand side, and (24.6.5) allows us to estimate iD, u(0,.)IIa, by || D? ul 
+||D, ull(o,2)- Hence it follows with a new constant C that 


(24.6.10) Dj ull? + .D, wll, aS C(x, D) ull? + lull, 1) 
+||D, ul]? + hw, le»). 


To estimate the term with j=2 in the left-hand side of (24.6.7) we shail 
use the equation r(x, D')u=D?u—p(x, D)u. The sharp Garding inequality 
gives in view of (24.6.6) if we now regard x, as a parameter 


|| xf ull) <SRe(—r(x, D’)u, |D'|? u) + C jul é). 
If we multiply by x,, integrate with respect to x, also, and use the inequality 
ab <4(ca? +b?/c), it follows that 

ella ullo, 2= S iIirte, D Wu llO,0) +2C|lx} ullo, 2)" 


By Cauchy-Schwarz’ inequality 
xf ullfo, HE Sx, Ull(o, auto, 1) 
so we have with another constant C 


1X1 ull(o, 2S Cllr, DY ull, 0) + llull(o, 1): 


Since r(x, D’)u=D{u— p(x, D)u it follows that we may add |x, ul|(, 2) on the 
left-hand side of (24.6.10), with an appropriate change of C. Since by (24.6.4) 


{lull fo, ySC(lD, ullio, atx; ullé, 2) 


we have finally proved that 
Set. i12 < 2 2 2 2 
y Dy #llo,23/3) S CUlp@, Dyull” + ello + Pr ¥ll” + lle, dilegy)- 


Now |Jull(o,1)Sllull%,o)lull’, gy» and |D, ul? <2)Dzul| |lull by the proof of 
Lemma 17.5.2. The middle terms on the right-hand side can thus be re- 
placed by a small constant times the left-hand side plus a large constant 
times |ju/[?. This completes the proof of Lemma 24.6.4. 
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For any seR we have more generally 


2 
(24.6.7) DIDI ~ulho,ss 24/3) C(\lp(x, D)ullo,y+ Il#Ilco,s) + }u(O, lis+4))- 


In fact, if we apply (24.6.7) to (1+|D'|”)"? u and use that 
[p(x, D), (1 +1D'?)"7] =[ —r(, D), (1 +1D'?)°"7) 
is of order s+1, we obtain (24.6.7) with an additional term ||ul[(o,,,1) in the 
right-hand side. Since 
lUllo.s+ eS Ullto, 544) ll4llfo, ») 


the estimate (24.6.7) follows. 


Proof of Theorem 24.6.2. If (xo, &)€HE then the principal symbol p(x, £)= €7 
—r(x, &) of P satisfies (24.6.6) in a neighborhood of (0, xo, &). Assume for a 
moment that (24.6.6) is valid when 0<x,<1, with reS? now, and that 
UEC”, _ — 
ueHi? .)(R*), PueHoy(R’). 

Let ¢eCV(R"—'), [¢dx'=1, and let A,=@(eD’) be convolution with 
$(x'/e)e*~" in the x’ variables. Then A,u has support in a fixed compact set 
for small e, and A,ue Heo? (IR",) so (24.6.7) can be applied to u for reasons 
of continuity. We have p(x, D)ue He") IR“) and 


p(x, D)(A,u) =A, p(x, D)u—([r(x, D), A,ju 


where the commutator is uniformly bounded in Op S'. Hence p(x, D)(A,u) 
is uniformly bounded in Ho, and it follows from (24.6.7! when e—0 that 
ueHos"? ,), which improves the hypothesis made on u. 

The arguments used to prove Lemma 20.1.13 and Theorem 20.1.14 can 
now be copied with no change to complete the proof of the theorem. We 
leave the repetition as an exercise for the reader. 


Theorem 24.5.3 shows that singularities always propagate from diffrac- 
tive points. Theorem 24.5.4 gives a solution singular on the bicharacteristic 
through (0,x’,é)eG, if (x’,é) is in the closure of the hyperbolic or the 
elliptic set. In fact, if (y’,7')EE is close to (x’, é’) then we can find a small 
y, >0 so that r(y, 7 =0, for ér(0, x’, €)/0x, >0 and r(O, y’, 7')<0. The bichar- 
acteristic through (y, 0, y’) lies in the half space x, 2y, for a fixed amount of 
time and converges to the bicharacteristic through (0,x’,0,é) when 
0,7) > &’, €) (cf. Lemma 24.3.4). 

However, when (x’, €’) is not in the closure of HUE, that is, r(0, y’', 7) =0 
in a neighborhood of (x, &), then all bicharacteristics passing close to 
(0, x’, 0, €’) turn with a cusp at the plane x,=0, as pointed out in the 
beginning of the section. We shall now show that nevertheless there is a 
solution which is singular precisely on a given bicharacteristic. 


Theorem 24.6.5. If (y’, nyeGynfE then one can find ueC(X) so that u=0 on 
OX, PueC®™.in a neighborhood of (0, y’), and WF,(u) is precisely generated 
there by the compressed bicharacteristic through (y’, 17). 
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Proof. We may assume that X CR” and that P has the usual form (24.4.2). 
For the principal symbol r we have by hypothesis r(0,x’,é}20 and 
Or (0, x’, €)/Ox, >0 in a neighborhood of (y’, 7’). Hence 

r(x, €)2ex,|é'? 
in a conic neighborhood of (0, y’,7’). Choose beS*? equal to R in a conic 
neighborhood of (0, y’, 7’) so that b satisfies (23.4.1). Set B=b(x, D’). 

By Lemma 23.4.7 we can then find uweH,,,_,(IR",) so that 
(D+ —B(x,D’))u=0 when x,<4 and u=0, D,u=@ when x,=0, where 
¢H,,_,(R"~') is given. In fact, we just have to choose some uy€H,. ,_ 1) 
with these Cauchy data (Theorem B.1.9) and can then find u—u eH, 4-1) 
<C° with Cauchy data 0 such that 

(Dj —B)(u — uo) =(B—D?) Up He »_ 1) 
If we choose ¢ with WF(@) generated by (y’, 7’) it follows from (the proof of) 
Theorem 23.4.8 and the interior propagation theorem that WF(u) for 
0<x,<4 is contained in the cone I generated by the bicharacteristic 
through (0, y’,0,7’). Hence PueC® in a neighborhood of (0, y’). The wave 
front set of u must there be equal to I, for Theorem 24.5.3 would otherwise 
show that (j’, n')€ WF,(u), hence that (y’, 7')€ WF(@). The proof is complete. 


Remark. For the Tricomi equation itself an explicit construction by means 
of Airy functions is easily made. 


24.7. Operators Depending on Parameters 


In this section we shall complete the arguments of Section 17.5 by proving 
Lemma 17.5.14. To do so we must take another look at Theorem 24.5.3 for 
operators depending on parameters. First we rephrase Theorem 24.5.3 as an 
estimate. 


Proposition 24.7.1. Let P and X be as in Theorem 24.5.3 and let I, I, T, be 
open cones in T*(X)\O such that every compressed generalized bicharacteris- 
tic <I which intersects [, must also intersect I, if maximally extended in I: 
Let B be a Banach space containing C°°(X) as a dense subset such that B is 
continuously embedded in Q(X) and the restriction to @X extends from 
C%(X) to a continuous map B>Q'(0X). For every compactly supported 
A,€%°(X) of order —0o outside a compactly generated cone CI, it is then 
possible to find compactly supported A, A,¢¥,°(X) of order —c outside 
compactly generated cones <T resp. <I, such that 


(24.7.1) A, ull S| APull+|A,u] + Cllull, 
if ueC%(X) and u=0 on @X. (|| || denotes L? norms.) 


Proof. The space 
F ={ueB; TOWF,(Pu)=0, u=0 on 0X, LZ AWF,(u)=9} 
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is a Fréchet space with the topology defined by jjul[, and the semi-norms 
|APul| and ||A,u|| with A and A, as in the proposition. By hypothesis and 
Theorem 24.5.3 we have I, 0 WF,(u)=@ if ue¥ Hence the map 


FaurrA,ueLl(X) 


is everywhere defined, and it is closed since it is continuous in the 9’ 
topology for A,u. Hence the map is continuous by the closed graph theo- 
rem, which proves the proposition. 


We shall now allow P to depend on a parameter z in a neighborhood Z 
of 0 in IR".. We assume that the coefficients are in C°(X x Z) and that P, 
satisfies the hypotheses of Proposition 24.7.1. We shall first prove the 
stability of this condition. 


Lemma 24.7.2. Assume that the hypotheses in Proposition 24.7.1 are fulfilled 
by P,. For every compact set K,<I, one can then choose compact sets K , 
<I, and KCTF and a neighborhood Z,<Z of 0 such that on every maximally 
extended compressed generalized bicharacteristic of P. intersecting K, there is 
an interval <K with one end point in K, and one in K,, provided that zéZ. 


Proof. Choose increasing sequences of compact sets K} <I, and K’cJ, each 
in the interior of the following one and with union I, and I respectively. If 
the statement is false we can find a sequence z,eZ converging to 0 and 
compressed generalized bicharacteristics [,stt> B,(t) of BR, such that 

(i) I, is a neighborhood of 0 on R and B,(O)eK,; 

(i) K3nB,U,)=0; 

(ii) BL) cK”; 

(iv) If tel, nel, then B,(t)hedK’; 

(v) If tef I, oI, then B, is maximally extended at t. 

We shall derive a contradiction by studying the limit as v— oo. 

When studying 8,(t) for t20 we distinguish two cases: 

a) B,U,OR,)cK* for a fixed uw and infinitely many indices v. After 
passage to a subsequence the right end point a, of I, has a limit a>0 as 
v>oo and B(t)=lim B,(4) exists if O<t<a, by the uniform Lipschitz con- 
dition observed after Definition 24.3.7. The proof of Proposition 24.3.12 
shows that f(t) is a compressed generalized bicharacteristic of P,. We have 
BO)EK, by @, BEL, when 0<t<a by (ii), and we claim that f(t) cannot 
be extended beyond a. If this is not true then the limit B(a—0) exists, H,, is 
non-radial there, and if B(a—0) is a glancing non-diffractive point then the 
gliding vector is not 0 there. But then it follows from Corollary 24.3.14 that 
B,(s) can be extended to sSa+e for some e>0 and all large v, which is a 
contradiction. 

b) Assume now that for every p we have 6K*nB,(,0R,)+9 for large 
y. For fixed p we have a uniform Lipschitz condition for the restriction of B, 
to the component of 0 in {tel, OR, ; B,(t)}¢K"}. Passing to a subsequence 
we thus obtain a limit B(t) defined for O<t<a such that B(O)EK,, BDEL, if 
0 <t <a and B([0,a))N OK" # O for all y. Hence G is maximally extended at 
a, as a compressed generalized bicharacteristic in I. 
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If we argue in the same way for t<0 we obtain a compressed general- 
ized bicharacteristic of R, intersecting K, but not J, which is maximally 
extended in I. This is a contradiction proving the Jemma. 


The analytical statement in Proposition 24.7.1 also has an analogue 
involving parameters: 


Proposition 24.7.1’. Let P have C® coefficients in X x Z and assume that P, 
satisfies the hypotheses in Proposition 24.7.1. For every I5GI, one can find 
5>0 such that if A,e%°(X) is compactly supported and of order — co outside 
I, then one can find A, A,€'%°(X) of order —o outside compactly generated 
cones <I and I, respectively, such that (24.7.1) is valid with P replaced by P. 
when zeZ and |z| <6. 


The proof of Proposition 24.7.1 was obtained by invoking the closed 
graph theorem to change the qualitative statement in Theorem 24.5.3 to a 
quantitative one. However, we could equally well have inspected all the 
steps in the proof of Theorem 24.5.3 to prove the estimate which would 
then have been given a more precise form. Moreover, a proof of that kind 
remains valid for z so small that Lemma 24.7.2 is applicable. We leave for 
the reader to satisfy himself that this is true and turn to our application of 
Proposition 24.7.1’, the proof of Lemma 17.5.14. 

As a preparation we shall give an explicit interpretation of the geometric 
condition in Proposition 24.7.1 in the simplest case. Thus we consider the 
constant coefficient wave operator 67/dt?—A in {(t,x)eIR"*';x,20} and 
determine the generalized bicharacteristic flow backward in time from 
points in the forward light cone {(t, x); t>|x|,x, 20}. This is identical to the 
bicharacteristic flow backward in time in IR"*! from the full open forward 
light cone, that is, the union W of the conormal bundles of the backward 
light cones with vertex in this set. We claim that if —|x|<t<jx| then 
(t,x, t, €)e W if and only if 


(24.7.2) z(<x,  +t7)>0. 


For the proof we first observe that (t, x,t, is in the conormal bundle of 
the light cone with vertex at (to, x9) if 


(t —t9)? =|x—x9l’, t(X —Xo) +(t fo) E=0 
(cf. (17.4.11)). This implies |z]|=|é] +0. The condition can be written 
(tX+t6)/tp —C =TXo/to 
and since x, is any point with [x,|<tf,) in IR” it is equivalent to |y—€|<|t| 


for some t,>0 if y=(tx+t)/tp. Since |¢|=|t] this is true if and only if 
«tx +té, €> >0, which is condition (24.7.2). 


Proof of Lemma 17.5.14. We shall establish the required estimates for z close 
to a point z)¢K,y where we may assume that P| = —A. We shall therefore 
first consider solutions of the wave equation 


(62/6t27—A)v=0 in Q={(t,x); |t|<3, x, 20} 
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which vanish when x, =0 and in 
Q, _ {(t, x); |x| > \t| +4, |e] < 3, xy 20}. 


By Lemma 17.5.12 the last condition follows if |y|<4 when (0, y) is in the 
support of the Cauchy data. (The same is true for solutions of (67/dt? + P) v, 
=0 satisfying (i), (ij) in Lemma 17.5.14 if |z—z | is small enough.) The 
Green’s kernel with pole at such a point is only singular in the union of the 
cones {(t, x); t?=|x —y|?} when |y|<4, so it is C® in 

Q,={(t,x); t>|x|+4, <3, x, 20}. 
Let Tj=T*(Q)~0 for j=1,2, and let I be the union of I, the complement 


of the compressed characteristic set ¥ in T*(Q,)\0 and the compressed 
generalized bicharacteristic flow backwards in time over Q from I. Here 
Q, = {(t, x); t>4—|xl, {e| <3, x4 =0}. 
It is clear that !'<T*(Q,) and that the intersection with ¥ is defined over 
2,;\Q, by (24.7.2) with t replaced by t—4. Hence I is an open cone. Note 
that 2,<Q, and that Q,0 {(t, x); tS$0}<Q,. The hypotheses of Proposition 
24.7.1 are thus fulfilled by the wave operator and I, i, I,; we may also 
replace [, by I. 
Let MEQ, be a compact neighborhood of 
My={(t, x); 1StS2, |x| $3}. 
Our problem is to estimate the derivatives of the functions v, in Lemma 
17.5.14 uniformly in M. Let K, be the unit sphere bundle in T* restricted to 
M, and choose, using Lemma 24.7.2, a compact neighborhood Z)<Z of Z 
and open cones 
mel, GeEeft,x»ehst<—-1}, el, 


such that K, Jj and the hypotheses of Proposition 24.7.1 are fulfilled for 
these cones and any P, zeZ,y. Fix z arbitrarily in Z) now, and let K{ be the 
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union of K, and the compressed generalized bicharacteristics for P back- 
ward in time from K, to the plane t= —1. Let KZ be the intersection with 
that plane. We can then choose open conic neighborhoods I) of Kj such 
that 'arMsrs...> K* 

and the hypotheses in Proposition 24.7.1 are fulfilled for P with I; replaced 
by F°* and I replaced by F™ when v=1,2,.... In fact, when I has 
already been chosen it follows from Lemma 24.7.2 that the compressed 
generalized bicharacteristics going backwards from a sufficiently small 
neighborhood of K{ must be contained in r™ until t= —1—e for a suitably 
small ¢. 

From Proposition 24.7.1' we obtain if € is sufficiently close to z that 
there is a uniform bound for ||Av,|| for any A e%"(Q) which is com- 
pactly supported and of order — oo outside a compactly generated cone 
<I) the bound depends on A‘ of course. Differentiation of the equation 
(67/ét? + R)v,=0 with respect to ¢, gives 


(67/61? +R) 60,/6C;= —(6F/0;) v= f 


Since we already control A™f we now obtain from Proposition 24.7.1’ a 
bound for ||A® dv,/d¢,|| when ¢ is in some possibly smaller neighborhood of 
z, independent of Ae), and A® is of order —oo outside a compactly 
generated cone <I, Continuing in this way we obtain a bound for 
|A™ Dtv,|| if A is of order —oo outside a compactly generated cone cr 
and ¢ is sufficiently close to z, |«|<v. In particular we obtain uniform 
bounds for any derivative Df, -v,(t,x) when (t,x)eM and ¢ is in a neigh- 
borhood of zeZ, depending on «. By the Borel-Lebesgue lemma this im- 
plies a uniform bound for all z€Z, which completes the proof. 


Remark. Note that over Q, there are many singularities for the Green’s 
functions. It is only thanks to the microlocal point of view that we have 
been able to avoid difficulties from them. That is why it was not possible to 
prove Lemma 17.5.14 in Section 17.5. 


Notes 


The energy integral method was already used by Krzyzanski-Schauder [1] 
to derive estimates for the mixed Dirichlet-Cauchy problem for second order 
operators. Much later Garding [6] determined the general first order 
boundary conditions which can be used instead of the Dirichlet condition to 
derive fairly weak estimates. For higher order operators energy estimates 
were first given by Agmon [6]. In addition to what has later become known 
as the uniform Lopatinski condition he required that the real roots of the 
characteristic polynomial in the normal direction should be at most double. 
This condition was removed by Sakamoto [2]; similar results were proved 
for systems by Kreiss [1]. (See also Chazarain-Piriou [1].) For a discussion 
of correctness in I? we also refer to Sakamoto [3]. 
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We have restricted ourselves here to the mixed Dirichlet-Cauchy prob- 
lem for second order operators in order to be able to cover the boundary 
regularity theory fully in that case. As seen in Section 24.2 the discussions of 
the elliptic and the hyperbolic regions are essentially microlocal versions of 
elliptic boundary problems and strictly hyperbolic Cauchy problems. (See 
also Chazarain [1] for the hyperbolic case.) However, the glancing case poses 
essential new problems. These were first overcome in the diffractive case by 
Melrose [3] and Taylor [2] using parametrix constructions. (See also Eskin 
{2].) The work of Melrose was inspired by the explicit solution given by 
Friedlander [1] for the wave equation in a layered medium; his example is 
the model operator in Theorem 21.4.12 here. Taylor on the other hand was 
led by the asymptotic expansions of geometrical optics as given by Ludwig 
[2]. The gliding case was then treated by Andersson-Melrose [1] using the 
equivalence of glancing hypersurfaces proved by Melrose [2]. Another proof 
based on a parametrix construction was given by ESskin [2]. Later on 
Melrose and Sjéstrand [1] gave the geometrical discussion of generalized 
bicharacteristics which we have followed here in Section 24.3 with minor 
modification. They proved Theorem 24.5.3 using only energy integral argu- 
ments independent of the Melrose equivalence theorem except in the diffrac- 
tive case. For that case Ivrii [2] derived the propagation theorem by energy 
integral arguments, thus eliminating the need for the parametrix construc- 
tions of Melrose and Taylor. We have followed his paper to the extent that 
it has been possible to do so. A new result may be the observation that in 
the diffractive set it is not necessary to assume that the gliding vector does 
not vanish. For results on more general boundary conditions the reader 
should consult Melrose-Sjéstrand [1] and Eskin [3, 4]. 

The interesting Example 24.3.11 has been taken from Taylor [2]. Fur- 
ther information on the non-uniqueness of generalized bicharacteristics at 
G® can be found in Melrose-Sjéstrand [1]. For real analytic boundaries this 
problem does not occur. However, if one is interested in analytic singulari- 
ties then singularity propagation along the gliding ray occurs also in the 
diffractive case, so singularities will in general spread from bicharacteristics 
to sets of higher dimension. (See Friedlander-Melrose [1] for a special case 
and Sjéstrand [4] for general results.) 

The construction of solutions with given singularities in Sections 24.2 
and 24.5 is a modification of the constructions in the interior case which 
seems to have been given first in Ralston [1]. The simplification given here 
by means of positive Lagrangians has been used by many authors for 
similar problems concerning interior singularities (see e.g. H6rmander [29)). 
In the present context it has been used recently by Ralston [2]. 

The results in Section 24.7 are taken from the simplification by Melrose 
[7] of the work of Ivrii [3] on the asymptotic properties of the eigenvalues 
of the Dirichlet problem. Melrose also used the idea to derive the qualita- 
tive part of Theorem 17.5.5 and applied it to similar questions for systems 
as well. 


Appendix B. Some Spaces of Distributions 


The purpose of this appendix is to collect the definitions and basic proper- 
ties of the spaces of distributions used in the text. To a large extent they are 
special cases of spaces discussed in Sections 10.1 and 14.1. However, we shall 
give an essentially self contained exposition since this requires only minor 
repetitions. 


B.1. Distributions in IR" and in an Open Manifold 


Already in Section 7.9 we introduced the important Sobolev space H_,(IR") 
consisting of all ueY'(R") with de L,, and 


(B.1.1) Il) 5) =(2m)~” J ACSI? +11) do) < 00. 


When s is a non-negative integer then H,, consists of all u such that 
D*ueL?(IR"), |al<s. (Using the pseudo-differential operators introduced in 
Chapter XVIII one can define H,.,. for any s as the set of distributions 
mapped to I? by all operators of order s.) Thus H,,) is the inverse Fourier 
transform of L?(IR",(1+|é|?)§dé/(2z)"). In Section 14.1 we introduced modifi- 
cations of these I? spaces. We shall now consider their inverse Fourier 
transforms which are modifications of H,,) spaces. As in (14.1.2) we set 


(B.1.2) X,={GeIR";R,;_,<|&|<R,}, where Ry=0, R,;=2/~* ifj>0. 
Recall that by (14.1.8) 

(B.1.3) R2/4<(1+|é)?)<2R?,  EeXp 

For any ue/’ with tel’, 


nju(x)=(2n)- fb (dE, 


we set 


that is, we multiply the Fourier transform by the characteristic function of 
X ;. Then it follows from (B.1.3) that 


(B.1.4) 27" Iz;4MloyS \|n july Rp oS!" ||; 410), 


and we have 
lullQ=d lz;4llo- 
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This suggests the following definition of more general Besov spaces: 


Definition B.1.1. If 1 Sp<0o and seR, then ?H,, is the Banach space of all 
ue’ with deL2,. and the norm 


loc 


(B.1.5) Pull =(L Ianjulle)” 
finite. (This should be read as sup, ||z,ul|,,) if p= 00.) 

It is clear that ?H,, is a Banach space and that 7H 
have 
(B.1.6) 2-H Pull S(O LR ty llfoy)? $2"! ? Hl 


which describes ?H_,,. and its norm in terms of the P? norm || Il(o) Only. Since 
’ <I4 if p<q it follows at once that ?H_,, increases with p. We have 


(B.1.7) °H. cH, if s,>5p. 


() =H. By (B.1.4) we 


In fact, 
al gy SAY RP wy ull SZ HY RE-H Illy 


Summing up, we have found 


Pionsenior B12. "H,,.,<”H,,., if (and only if) s,>s, or s,=s, and 
PySP2- 


If p<oo then 
N 
PS x,u—uly70, Noo, if ue?Hy. 
i 


Since Cf is dense in P. it follows that C? is dense in the Fourier 


comp 


transform of ?H,,. Hence / is dense in ?H,,, which implies that C> is dense 


in ?H,,. because C> is dense in & If 1/p+1/p’=1 we have 
Iu, =O S? |ull_g)? ells ue? Ay, veCP; 


which proves that ? H (~s) is continuously embedded in the antidual space of 
_?H,,. From the converse of Holder’s inequality it follows that the embed- 
ding is an isomorphism: 


Proposition B.1.3. If 1 Sp<o then the antidual of ?H,, is i: Oa where 1/p 
+1/p’=1. 


Example. The spaces B and B* of (14.1.1) and (14.1.3) are the Fourier 
transforms of 'H (» and ”H,_4) respectively. 


We shall now discuss how ’H,,. can be obtained as an interpolation 
space which is a convenient way of deriving various estimates. (See also 
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Lemmas 14.1.3 and 14.1.4.) If we?H,, and u,=7,u then we know that u 
=) u, with convergence in #’ and that 

[lUglly Ri, 2"'M,, teRR, 
where M,,= ||, ull (9), thus 

(Yi (Ri M,)?)"/? S24! lull ,,). 
The following proposition gives a converse. 
Proposition B.1.4. Assume that u,€H,..H(.,), K=1,2,... and that 
(B.1.8) ugllisy SRM, = v=1,2; k=1,2,.... 


If sy<s<s, and RiM,el?, then )'u, converges in ¥', the sum belongs to 
PH), and 

(B.1.9) FY ually S Cls)(L (REM PY”. 

If p<0o the series converges in ?H,,, norm. 


Proof. It suffices to prove (B.1.9) when the sum is finite and then apply it to 
the difference between partial sums with s replaced by a smaller number if p 
= oo. Then 2,u=)'7,u,, and (B.1.8) gives 


RS” 5 Ug (oy SC llugllsy SRE My 
With v=0 when k2>j and v=1 when k<j we obtain 
Rj Ir, ulloy SD a, Ri My, 
a,=RpP-SRS-° if k2j;  ay=RE RS if k<j. 
Since R, is a geometric progression we have 


Yan), Yay,sCs) if Cis)=(1—2%-%)-'+(24-—1)7?. 
j k 
The estimate (B.1.9) follows now from a classical lemma of Schur: 


Lemma B.1.5. If })|a,,|<C and }'\a;,|<C for all k and j respectively, then 
ri k 
(SIM aj xl?) SCL x)?; = 1S poo, xel?. 
j ok 


Proof. The statement is obvious if p=1 or p=oo. Otherwise we obtain if 1/p 
+1/p’=1 using Hélder’s inequality 


I»; a. X,|? S (X |aj,\)?"” s la jl XPS Cry. aj, |X,l?. 
k k k k 


Hence : 
¥ y Aix xP < Citplp XY \x,l? =C? > \x,{? 
jok 


which completes the proof. 
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Corollary B.1.6. If T is a continuous linear map H,,,) > H(.,_ m=) which restricts 
toa continuous linear map Fey > Hes m for some s,>S, then T restricts to 
a continuous linear map.?H..—>?H,_ my if Sy<S<s, and 1<pSo. 


Proof. If ue?H,, then Tu=)'Tx,u with convergence in H 
=0,1 we have 


, and for v 


(so-m 
Tx, Ulles, — my S CUMUM yp SCO RY | Ull (oy, 


(SRE, ullfoy)/? S2'P luli. 


The statement follows now from Proposition B.1.4. 


Theorem B.1.7. If 6€C®” and D*@ is bounded for every a, thenut> duisa 
continuous map in ?H_, for all s and p. 


Proof. The continuity in H,, is obvious if s is a non-negative integer. Since 
the adjoint of the map H,_,3ur-gueH,_, is the mapH,,surrgueH.,, if s 
is a negative integer, the continuity in H. follows also in that case. Hence 
the theorem is a consequence of Corollary B. 1.6. 


Theorem B.1.8. Let w be a diffeomorphism of an open set X,<IR" onto 
another open set X,cIR", and let yeEC>(X,). Then the maa 
W*(xyvje€(X ,)c ER") is in ?H,,(IR") if ve? HR"), and we have 


PIW*(X VII SCS) lvl) 
Proof. By Corollary B.1.6 it suffices to prove that 
I(x MDI SCO) [Pll 


if s is an integer. When s is a positive integer this follows immediately by 
differentiating the definition 


w*(xv)=x) op) 


and taking w(x) as a new integration variable. If s is a negative integer we 
shall argue by duality. Choose 7,eC7(X ,) equal to 1 near supp y* y. Then 
we have for ve C7 (IR") 


Y*(x 0), w)l =" (x0), x1 wl =l(idet >") 1x0, "FO WIS Clos lell_w 
by Theorem B.1.7 and the first part of the proof. Hence 


IWF My SC lols, 


which proves the theorem. 


Remark. For the spaces Hy="He, arbitrary seIR, one can also prove 
Theorem B.1.8 by means of (7.9.3), (7.9.4). 
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By Theorem B.1.7 we can use Definition 10.1.18 to define PHS (X) if X 
is an open subset of IR"; this is the set of all uwe@’(X) such that 
pue?H,,(IR") for every peCy(X). More generally, if X is a C” manifold a 
distribution ue9'(X) is said to be in "H(S(X) if 


(x7 ")tu=u,e? Hey (X,) 


for every C” coordinate system x: X, > X,<IR". The topology is defined by 
the semi-norms 
Ur? ||P Uxll¢s) 


where @ is an arbitrary element in C®(X,) and x is any C® coordinate 
system. From Theorem B.1.8 it follows as in the proof of Theorem 6.3.4 that 
it suffices to take x in an atlas for X and d=, provided that the sets 
{xEX,; b,(k x)+0} cover X. In particular, if X CR" and KcX is compact, 
then PHO (X )né'(K)<?H,,(IR") and the topology is that defined by ?| ||, 

We shall now discuss restrictions to submanifolds starting with the 
restriction to a hyperplane in IR". If we (R") we denote by yu the restric- 
tion to the hyperplane x, =0, identified with R"~': 


(yu)(x')=u(x’,0), x’eIR"= 1. 
Theorem B.1.9. y extends to a continuous map from ‘H,,)(IR") to H,o)(IR"~*) 
=[?(IR"~'), which we shall also denote by y. For every s>0 the restriction of 


y maps ’H,,, ,(IR") continuously onto ?H,,(IR"~') and has a continuous right 
inverse from ?H,.(IR"~*) to ?H(,,1)(IR"). 


Proof. With the notation ,u in (B.1.4) we have n;ueC® and 
lly mj ulloy=(2m) "J a(S) dg, /2nl? dg’ 
geX; 
S(2n)7*-" 2R; j jag)? dé=n-* R; |; Ul). 
xj 
If deC> then u=))72,u with only finitely many terms +0, and the triangle 
inequality gives 
Iv ulloysd lly T;UlloyS Cc ‘lull ay: 


Such functions are dense in *H|,, which proves the first statement and that 
yu=) y7,u for every ue'H,,). 
If ue? H,,, ,,(IR") we have for t20 


4 ly 2; Ully S2' Ri lly 24h (0), 
an 
[Rp Yr UlloyS |R5** 2, ulloy 


where the right hand side is in /?. Hence Proposition B.1.4 applied to partial 


sums gives if s>0 
Py uly SCS) lulls). 
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Thus y is continuous ?H,,, ,(IR")>”’A,,(IR"~ '). To construct a right inverse 
we choose weC> (IR) with y(0)=1 and set for ve?H,,(IR"—*) 


Uy (x) = 0, (%') ¥ (Ry x,) 


where x’=(x,,...,X,_;) and v,=2,v is defined as before but with IR” 
replaced by R"~*. Then 


(6) =8,(E)W(E,/Ry/Re- 
Vall y= (20)-” FLEE? WOE, / RR? +1? de 
SCREW * SEW? WE MPL + SR dE 


<C, Ri’~ ' || 7, vif): 
This gives (B.1.8) with M,= C? Re * I t, Plo) Since 
Rt M,=C/ Ry Il, Pll oye? 
if ve? H_,,,, it follows from Proposition B.1.4 that the map 


v> Vu, 


is continuous from ?H_,, to ’H,,, 4). It is obviously a right inverse of y when 
s>0Q. The proof is complete. 


Remark. In Chapter XIV we proved that y is also surjective from 'H @(R”) 
to H,o(R"~ ') but the right inverse above is not continuous from Ho (IR"~') 
to 'H,,(R’). 


In Theorem B.1.9 we have just discussed the restriction to a hyperplane. 
However, iterating the result k times we obtain that there is a continuous 
restriction map from *Hy,.)(IR") to Hjo)(R"~*) if x'eIR"~* is identified with 
(x’,O)eIR". It restricts to a continuous map from ?H,, , ,,.,(IR") to ?H,.(R"~") 
for every s>0 and integer k>0. 

Since we have defined the restriction mapy by continuous extension 
from the smooth case it is clear that it is invariant under a change of 
variables. If Y is a C® submanifold of codimension k of a C® manifold X 
and u € PHIS(X ) we can therefore define the restriction yy u of u to Y ifs > k/2, 
and yyu €? Hi, . then. 

The attentive reader may have noticed that the right inverse constructed 
in the proof of Theorem B.1.9 was defined for all s although we could not 
assert that it was a right inverse of y if s<O since y is not defined in ”H,,, 5, 
then. This suggests that y should be defined in some larger spaces. We shall 
now define such modifications of the H,,, spaces. (For the sake of brevity we 
no longer consider analogues of the Besov spaces.) In the definition the 
plane x,=0 plays a distinguished role; we note that the first n—1 coor- 
dinates €’=(€,,...,€,_,) of €elR” are coordinates in its dual space. 
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Definition B.1.10. If m,seIR we denote by H,,,,(IR") the set of all vue’(IR") 
with del? and 


loc 
(B.1.10) Ul on, sy = 2m)” J HACE\P CA + 1E|7Y"(1 + 1E'1?)° d 6) < 00. 
This is the special case of the spaces B, , of Definition 10.1.6 with 
k(E)=(1 + 16)7"70. +1279. 
Thus multiplication defines a continuous bilinear map 
S X Ay, )3(0, rg ue H 
by Theorem 10.1.15, and Theorem 10.1.8 gives 
(B.1.11) H, (R<— Hen, 55 


Co (R") is a dense subset of H,,, for all m,s. If m is a positive integer then 
A m,.) consists of all u with Di ueH,o,,4 mj for OSj<m, and veH,,,. means 
that v can be regarded as an I? function of x, with values in H,,(IR"~ '). 

As in Theorem B.1.9 we denote by y the restriction map 


(yu) (x')=u(x’,0), ueS(R"). 


(m, s) 


Gs (R")<>m,<Sm, and m,+s,Sm,+5,. 


Theorem B.1.11. If m>4 there exists a continuous extension of y to a 
map Hy, .(IR")> Hm +3-4)(R"~?). There is also an extension map e from 
UH,)(R"~') to ¥'(IR") such that e is continuous from H,.(R"~*) to 
Hm,s—m+4)UR") for all s, m, and ye is the identity. Moreover, e can be chosen 
so that yD‘ e=0 for every positive integer k, and xke is a continuous map 
from H,(IR"~*) to Hoy, 5+%—m+3)UR") for all s, m. 


The last statement has a corollary which is analogous to Corollary 1.3.4: 


Corollary B.1.12. If m is a positive integer and f,EH(.,m—,~—3)(R"~ 1) k 
=0,...,m—1, then one can find uéH,,, )(R") with y Dku=f,, k=0,...,.m—1. 


Proof. We just have to take 


m—1 


u= ) xkef/k!. 
0 
Proof of Theorem B.1.11. lf ueSY and v=yu then 
[B(E)? =|f AO) ae,/2n|? Sf la(OP7 + EP" aE, J + [EPI den (2m)? 
SCH I aCSIP CL + LEP?" +1 PF dE, /2 0. 


The last inequality follows by taking t = €,/(1+|¢’|?)? as a new variable instead 
of &,. Here it is essential for the convergence that m>4. If we multiply by 
(1+ |é’7)s*™-+ and integrate it follows that 


NOU (s4 m— 4) S Cou ll tll on, sy 
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To construct the extension operator e we argue essentially as in the 
proof of Theorem B.1.9 taking weC? equal to 1 in a neighborhood of 0. 
With A=(1+ £7)? we define ev=u for ve¥ so that 


,(€', X,) = W(Ax,) 0(¢'). 


Here uv, denotes the Fourier transform with respect to the x’ variables. It is 
then clear that ueY, that u(.,0)=v and that Diu=0 for x,=0 if j+0. We 
have 


\|Di xh Ullo,s+k—j+4) =(22)' mal (a(é? APETKE-DFT Ge J [Di xh WA x)? dX, 
=(2n)'~" | (EV? Ade [IDE xi w(x,)I? dx, 
Ifmisa positive integer we obtain by adding for OSj<m 


k 
Xn UllOns+k—m+4) = Crk Pll 


which completes the proof. 


B.2. Distributions in a Half Space 
and in a Manifold with Boundary 


We shall denote by IR", the open half space of IR" defined by x, >0 and by R" 
the complement of its closure R".. If F is a space of distributions in IR" we 
shall use the notation F FIR", ) for ‘the space of restrictions to IR". of elements 
in F and we shall write F(IR",) for the set of distributions in F ‘supported by 
R",.. Thus F(R") is a subspace of Y’(IR",) and a quotient space F /F(IR") of 
F; it contains the quotient of F(IR",) by the subspace of elements with support 
in OR", = {xeR"; x, =0}. The notation is meant as a reminder of these facts. 


Example. Ce(R" ) consists of the restrictions to IR", of functions in C,°(R"). 
It follows from Theorem 1.2.6 that it can be identified with the space of 
functions in C°(IR") vanishing outside a compact set. The space C#(IR",) is 

the subspace of functions vanishing of infinite order when x,=0. The dual 
space of C@(IR",) is by Theorem 2.3.3 equal to the space gi", ) of distri- 
butions with support in IR", and that of Cr (iR", ) is the space D’(IR",) of 
extendible distributions, by the Hahn-Banach theorem and Theorem 2.3.3. 


We shall mainly discuss the spaces H,,,,(R",.) and H,,,(R",), with the 
quotient and induced topology respectively. Thus we set 


Il cn, sy = UF || U Il on, 59> uéH,,, y(R",), 


with the infimum taken over all UeH,,, ,(IR") equal to u in R’,. 
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one B21. Co(IR*.) is dense in_ Hm, AR"), Cx(IR",) is dense in 
Hm,)(R",), and the spaces H,, )(IR".), H(_m,—9(R".) are dual with respect to 
an extension of the sequilinear form 


fuddx; ueCP(R’,), veCF(R"). 


a Since C@ is dense in H,,, »(R") it is clear that C7(R") is dense in 
Aa, QR"). That CR") is dense in H,,(R’, ) follows from Theorems 
10.1.16 and 10.1.17, where we take @ with support in R’,. Now H a (iR". ) 
is a closed subspace of H,,,,(IR") so its dual space is the quotient of 
Fi(_m,~s)UR") by the annihilator of the dense subset Cy (IR’,), that is, the 
subspace of elements vanishing in R",. That is by definition H,_,,,_,(IR".). 


it follows from Theorem 10.1.8 that 
(B.2.1)  Hon,.sR") © Hims.s) QR.) m,Sm, and m,+s,Sm,+5,. 


Using Theorem 10.1.10 we obtain 


Theorem B.2.2. Let K be a compact subset of IR".. Then 
{u; u€H@, .,0R")s {UU as s1) = 1, supp uc K} 


is a compact subset of A ma,s>)(R".) if m,<m, and m,+s,<m,+s,. If K has 
interior points, these conditions are also necessary. 


Proof. The necessity follows immediately from Theorem 10.1.10 if we shrink 
K to a compact subset of IR”. To prove the sufficiency we take a sequence 
WEA m,.5 CR") with suppu,cK and |juyllgn,.s)<1. Then we can find 
U,€Hm,,s,(R") so that U,=u, in R* and |U,|lan,,s, $2. Choose S€ CP (R") 
so that 6=1 in a neighborhood of K and set U;=@®U,. Then we have 


| US llent.sy = Cl Uy ll mi,s1) S2C 


where C is a constant. Since supp U,csupp® for every v, it follows from 
oe 10.1.10 that there exists a subsequence U,, converging in 

FH my,s,)R") to a limit Ue H m;, ;)(R"). But then the sequence u,, of the 
restrictions to IR", converges in H,,,, ,)(IR",) to the restriction of U to R’,. 
The proof is comiplete. 


We shall now prove some results which lead to a more direct description 
of H,,,,)(R".) when m is a non-negative integer. 


Theorem B.2.3. In order that Ue Hp, GR") it is necessary and sufficient that 
UC Hn t,s+1)UR",) and that D nUEH oy 1,90R",); we have 


(B.2.2) 3 5 llullen. j= SD, Ulloa 1, ot tulle Lst)= Sllullen, s)* 
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Moreover, uc H,,,,,(R",) if and only if uc H,,,,..10R",) and D, jUEH y, >—1)CR",) 


when j <n, and we have : 
ne 


(B.2.3) itl bn, s) = It Gas — at Y || Dj ull on s— 1)° 
i 


Proof. Let ue H,,,,(IR".) and choose UeH 
I U liom, s) = Ul mn, s)* Then 


Dy UI m— a.) VU lent. 1) = VU Mons) = le lle ss 


which proves that D eH oy 1,sUR".), that ue Hay, 1,s+1)0R".) and that the 
second inequality in (B. 2.2) is valid. The rest of the theorem follows at once 
from 


(IR”) so that U=u in R" and 


(m, s) 


Theorem B.2.4. Set & =(€,,...,€, 1), <2 =(L4|€/?)* and 

(B.2.4) An, (6) = (<6 +76,)" <o>*. 

Then A,, (D) 6 =F *(A,, > deS(R"), is an isomorphism of F(R",) which 
exten by continuity to an isomorphism PAR", )o Ay m, sR"). The adjoint 


ms(D) extends by continuity from C(R",) to an isomorphism from 
Hm, Fm, R",) to PR". ). Thus 


(B.2.5) 4m, s) = |A,,, s(D) Ull raartys ucH,, (IR), 
and Hn, )R") consists of all ueF'(IR",) such that A, uEL?(R",). 


Proof. By Example 7.1.17 the inverse Fourier transform of A,,, (resp. A,,,) 
has support in IR", (resp. IR”). Hence A,,,(D) is a continuous map in 
F(R") with inverse A —m,—s(D), which proves the first statement since 
An, (OP = +1E7y"(1 +12’ \)5, The second statement follows by duality. 


Remark. Writing A,,,=(<D'> +iD,) A,_1,, we conclude from Theorem B.2.4 
that every ueH_ m,—)(R",) can be written u=ugt+D,u, where 
Ug H gy _ ~m, —5= GR") and u ne yy. _-m, -5\(IR".). We shall occasionally “need this 
dual version of Theorem B.2.3. 


Corollary B.2.5. If m and s are non-negative integers, then H. om, UR") consists 
of all ue? (R") such that D*ueL?(IR",) when |alSm+s and «,<m. For the 
norm we have the estimate 

B26) Cy ful2,y5 YL DtulZe<ilull2,4,  “€ Hin, 9 (RD). 


alsmt+s 
a Sm 


Here C, is a positive constant depending on s and m but not on u. 


The proof follows immediately from Theorem B.2.3 by induction with respect 
to m and the obvious fact that Ho,0)(IR”) = L?(R4). In the same way we obtain 
for any m and s 


(B.2.7) ||D“u|\(m—x4,s+a9—lo) S = lt Ilan, s) ifue Hon,sy(R%). 
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Since the elements in H,o,(IR") can be considered as I’ functions of x, with 
values in H,,(IR"~') it is clear that Hy ,(R".)=L(R,,H,(R"~ ')). Hence 
we also obtain from Theorem B.2.3: 


Corollary B.2.6. If m is a non-negative integer the space Hm,s)(IR".) consists of 
all ueY'(R",) such that Diu is an LP function of x,eR, with values in 
Ho 4m—) UR” ') when OSj<m, and |\u|| qq, is equivalent to 


4 


(5. (1DLu( dem) 
j=0 0 


The following theorem is an immediate consequence of Theorem B.1.11. 


Theorem B.2.7. If j is a non-negative integer and m>j+4, then the mapping 
Ce(IR",)aur+Di u(., x,) 


can for fixed x,20 be extended continuously to a mapping from Him, (R".) 
into H,4m—j—4)(R"~'). For every u€ Hm, (IR",) the map 


R,5x,-+Diu(.,x,)eH (IR"~') 


(s+m—j—4) 


thus defined is continuous, and 


ele || Di u(. ? xi Gimaiaay= Cwm, j Ul om,sy> ucH , .(IR",)- 
Since H,,(IR"~')< C’\(R"~') if s>(n—1)/2+v (Corollary 7.9.4), we obtain 


Theorem B.2.8. If ueH,,,)(IR",) it follows that D*u is continuous in IR", if 
lal|<m+s—n/2 and a,<m—3. 


In the study of boundary problems for differential operators Theorem 
B.2.7 combined with the following theorem related to Theorem 4.4.8’ often 
allows one to give a precise interpretation of the boundary conditions. Let 
X be an open subset of R", and let X°= XR"... Since UE Am, )(R",) and 
peCy(R") implies @ueH,,,(R",) we can as in Theorem 10.1.19 introduce 
the local space Hie (X °) consisting of all uweQD'(X°) such that 


pueH,, 9(IR",) for all pe Cy (X). 


Theorem B.2.9, Let P be a differential operator of order , 
P= ¥' a,(x)D* 


ja] Sp 


where a,¢C*(X) and the coefficient of D® is equal to 1. If 
ucH ,(X°) and PueHie (X°) 


(m1,81 (ma — #82) 


it follows that ueH °° .(X°) if mgm, and m+sSm,+s,, j=1,2. 
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Proof. Choose PEC F(X) and set v=@u. Assume first that mSm,+1. Then 
we have 


(B.2.8) Pve Heyy, (R%)s 

for b PucH in, - ” 5) Lom —p.s) by (B.2.1) since mm, and m+s<m,+s,, and 
Duc A yn sy CHR ys if lal<p since mgm,+1 and m+s<m,+s,. 
Also note that 

(B.2.9) D* VE H in, — cops 51 — la’) 


since v € A(m,,s,) This implies D vé Hm —p,s) if a, <p and |a| Sp, so it follows 
from (B.2.8) that Dév € Honus ) since this is true for all other terms in Pv. 
We claim that D/ J ve HG, _ hs)? O<j<up. Since this is already proved when j 
=p we may assume that j<p and that Di*'veH,,,_;1,). Now our claim 
follows from Theorem B.2.3 for Di v€ Hm, — js, <Hm—j—1,5+1)- When j=0 
we obtain veH,,, ,, that is, ue Her >. 80 the theorem is proved provided that 
msm,+1. To prove it in general we may as well assume that m,+s,=m, 
+s, and that m,;=m,-—k for some integer k>1. Then we have shown that 
ue Her + 1,s,;-1) and repeating this argument we obtain 


ueH ee Five 


Tyloc _ 
(m+ 2,81 — ayo WEA Gn 4k s4—) = (m2, s2)° 


The proof is complete. 


Remark. In the theorem we could equally well allow u to have values in ©” 
provided that the coefficients are N x N matrices and the coefficient of D* is 
invertible. After multiplication by the inverse of this matrix the proof is 
unchanged. 


The notion of C® manifold with boundary is defined by allowing the sets 
X,, in Definition 6.3.1 to be open subsets of the closed half space IR”,. Since 
R" is diffeomorphic to the open half space IR" it is clear that every 
manifold is also a manifold with boundary. Furthermore, if X is a manifold 
of dimension n with boundary, the set X° of all points xeX for which 
«xeIR", for some (and hence for every) coordinate system « such that 
xeX,, is a manifold of dimension n called the interior of X. The set of all 
xeX such that xx belongs to the boundary of IR". for some (and hence for 
every) « with xeX, forms a manifold of dimension n—1, called the bound- 
ary of X, and is denoted by 0X. (The coordinate systems in X° and 0X are 
of course the restrictions of those in X.) 

If X and Y are open subsets of R" and y: Y>X is a C® diffeomor- 
phism, then we can use Theorem 1.2.6 to extend wy to a C® map: Y>X 
between neighborhoods of Y and X in R". It is a diffeomorphism if Y is 
chosen small enough and X=y-(Y¥). If ueM(X) and suppucX then 
pt ueD'(Y) and suppW*ucY. There is an obvious identification between 
the set of such distributions for different choices of X; we denote it by 
BX). It follows from Theorem 2.3.3 that Y* defines a mapy*: 
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Q(X)>Q@(¥Y) independent of the choice of X,Y. Similarly we 
define the set 2’(X°) as the set of restrictions to X° of distributions in some 
neighborhood of X in IR”. The pullback y* maps BX) to 9 (Y°) since an 
extension to Y is provided by the pullback by ¢ of an extension to X. 

For any C® manifold X with boundary we can now define the space 
9" (X°) of extendible distributions in X and the space 9’(X) of distributions 
supported by X by an obvious modification of Definition 6.3.3. In view of 
Theorem B.1.8 we can also define the spaces AS (Xx "oY" (X°) and Hey (X ) 
<Q'(X) just as in the case of open manifolds. 

The notion of differential operator in a C® manifold X with boundary is 
defined just as in Section 6.4 for the case without boundary. If the order is 
Sm it is clear that it defines a continuous map from H(g(X°) to Hie »(X°) 
for every s. A boundary problem in X involves, besides a differential 
operator in X, a certain number of boundary conditions on the boundary 
0X. Each of these is defined by a boundary differential operator, that is, a 
linear map p of C”(X) into C®(éX) such that for every C®” coordinate 
system « we have 


(pujox-t=p*(uck7') in X,AIR" if ueC?(X). 
Here we have used the notation Rj={xelR"; x,=0}, and p* denotes a 


differential operator 
p(x, D) =)! a(x) D* 


with coefficients in C°(R30X,). Besides the order of p,. which was in- 
variantly defined in Section 6.4, we shall also consider the transversal order 
of p, which is defined as the smallest t such that af=0 for every x when 
%, >t. Equivalently, it is the smallest ¢ such that pu=O in X for every u 
vanishing of order t+1 on 0X. (The transversal order, like the order, may 
be +00 if 0X is not compact.) Note that a boundary differential operator of 
transversal order 0 can also be considered as a differential operator in the 
manifold dX. 

Now assume that p is a boundary differential operator with C® coef- 
ficients and of order p<oo. If the transversal order is <m—4, it follows 
from Theorem B.2.7 that the mapping 


C®(X,)aursp" ve C® (R20 X,) 
has a unique extension to a continuous mapping of Fes (Xe °) into 
Hm - pURgnX, ); we denote the extension also by p*. If ue Him (X°) it is 


clear that the distributions p"u, in R%0X, define a distribution in the 
manifold 0X. We denote this distribution by pu and obtain 


Theorem B.2.10. A boundary differential operator p with C@ coefficients and 
of order y defines a continuous linear mapping 


HG5(X°)aurspueH,_ y(0X) 


if the transversal order of p is <m—4. 
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As in Section 64 there is no difficulty in extending the preceding 
discussion to distributions with values in vector bundles and differential 
operators between them. We shall make use of such extensions freely with 
no further comments. 


Appendix C. Some Tools from 
Differential Geometry 


In this appendix we have collected some facts on maps and vector fields 
which are required in the main text. For most of the topics a more 
thorough discussion can be found in many texts on differential geometry. 


C.1. The Frobenius Theorem and Foliations 


The basic existence theorem for first order systems of differential equations 
may be stated as follows: If 


v=¥ v (x) 6/0x; 
1 


is a C® vector field in a neighborhood of 0 in R" with v(0)+0 then one can 
introduce new local coordinates y,,...,y, such that v=0é/dy, in a neigh- 
borhood of 0. We shall now discuss a similar result where we have several 
given vector fields. Recall that the commutator of two vector fields v and w 
is defined by 

fv, wlu=v(wu)—w(eu); ueC®, 


This means if w=)’ w,(x) 0/0x, that 


[v, w] = )"(v, dw,/0x, —w, dv ,;/0x,) 0/0x;. 


Theorem C.1.1 (Frobenius). Let v,,...,v, be C® vector fields in a neigh- 
borhood of 0 in R". If 


(C.1.1) v,(0),...,v,(0) are linearly independent, 
(C.1.2) [v2 =D Cin 
then there exist new local coordinates y,,..., y, Such that 


6/dy;,=)b,,0;, i=1,...,7 
1 


where b,, is an invertible matrix. 
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Proof. First we note that Vi=)a,;0,, i=1,...,r, also have the properties 
(C.1.1), (C.1.2) if det (a;;) +0, for 


Ly Gin Vy, Y ay v;] =y ax An lY,, v,] +¥a;,(0, a,j) vy =>} av; Aix) Vy. 


The conditions are also invariant under a change of variables. (This proves 
already that (C.1.1), (C.1.2) are necessary for the conclusion to be true.) 
Assume now that Theorem C.1.1 is already proved for smaller values of n, 
which is legitimate since it is trivial when n=1. Changing variables we may 
then as well assume that v, =0/0x,, and after subtraction of a multiple of v, 
we may also assume that 


vj=)b,0/0x,  j=2,...,7. 
ait 


When x,=0 these vector fields in R"~* satisfy our hypotheses so changing 
the coordinates x,,...,x, we may by the inductive hypothesis assume that 
b, =0 for j=2,...,r and 1>r when x, =0. Now 


rr. 
0b y/0X, =v, vj X,=[04, 0] = Di Cr je PeX 
k=2 
3 
=P) Crp dys 1H 2,00, 05 JD o05F. 


k=2 


The uniqueness of the Cauchy problem for first order ordinary systems of 
differential equations now gives that b,=0 if />r, in a neighborhood of 0, 
and this completes the proof. 


Corollary C.1.2. Assume that v,,...,v, satisfy the hypotheses of Theorem 
C.1.1, and let f,,...,f,€C®. Then the equations 
(C.1.3) vju= 


have a C® solution in a neighborhood of 0 if and only if 


yp j=l. 


(C.1.4) vf, -0f= DY Cpfs bIzLr 
k=1 


If S is a C® manifold of codimension k with tangent plane at 0 supplementary 
to the plane spanned by 0,,...,v, then there is a unique solution in a neigh- 
borhood of 0 with prescribed restriction uyeC®(S) to S. 


Proof. The conditions (C.1.4) follow immediately from (C.1.3) and (C.1.2), 
and they are obviously invariant if we change variables or make linear 
combinations of the equations. By Theorem C.1.1 we may therefore assume 
that uj = 0/Ox;,j =1,...,r. Then cj = 0 and (C.1.4) reduce to 


(C.1.4) af,/ox,—ef/Ax,=0;  ij=1,..,7. 


If x'=(x,,...,x,) and x” =(x,,,,...,x,) then S is defined by x’=h(x”) in a 
neighborhood of 0, and (C.1.3) means that for fixed x” the differential of u is 
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the form ¥° f,dx, in x’, which is closed by (C.1.4)’. Thus 
1 # 
u(x)= JD fpdxjtug(h(x”), x”) 
h(x’) 1 : 
is the unique solution of (C.1.3) which is equal to up on S. 


The coordinates y in Theorem C.1.1 are of course not uniquely de- 
termined. If ,,..., 9, are other coordinates which enjoy the same properties, 
then 


d/Oy;= Yi O9,/6y, 0/05, 
k=1 


must be a linear combination of 0/0),, ..., 6/05, when j<r. Thus 09,/0y,=0 
when j<r and k>r, that is, (j,,,,--.,§,) is (locally) a function of 
(Yy419-++> ¥,)- This suggests the following global notion: 


Definition C.1.3. A C® manifold X is said to be foliated of dimension r if it 
has a distinguished atlas F such that for the map 


G=K ok 1: K(X, NX,)OK(X,0X,) 
the components ¢,,1,.-., @,(x) are functions of x,,,,...,x, only if x, neF 


Here we have kept the notation in Definition 6.3.1. A foliation in X 
defines for every xeX a plane 2, T;,(X) of dimension r by the equations 
dx,,,=...=dx,=0. By Theorem C.1.1 the additional property of a foliation 
is that, in a neighborhood of any point, 2, has a basis v(x), ...,v,(x) such 
that v,,...,v, are C® vector fields satisfying the Frobenius conditions 
(C.1.2). (To see that this is sufficient one should note that every atlas has a 
refinement for which an arbitrary intersection of coordinate patches is 
connected.) For every x,¢X there is locally a unique r dimensional man- 
ifold 2<X, called the leaf of the foliation through xo, such that T,(2/) = 7, 
for every xe€X. With local coordinates in the distinguished atlas, X is defined 
by x,,,=constant,..., x,=constant. Alternatively, it is the set of points 
which can be reached from x, by moving along integral curves of the vector 
fields v,,...,v, inside a suitable neighborhood of x,. The set of all points 
which can be reached along such curves is the global leaf of the foliation. 
With any point it contains the local leaf through it but in any neighborhood 
there may be a countable number of other leaves corresponding to integral 
curves which are nearly closed. For further details we refer the reader to 
Haefliger [1]. 


C.2. A Singular Differential Equation 


Let v be a real C® vector field defined in a neighborhood of 0 in IR". We 
assume given a splitting x=(x’,x”) of the coordinates in IR", where x’ 
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=(x,,...,x,) and x” =(x,,1,-..,X,), and that v=v,g-+w where 
(C.2.1) U9 =<Ax’, 6/6x’D, 
(C.2.2) WONG ORs w,=0, dw,/ax=0 when x'=0. 


Thus v is singular when x’=0 and only then if the real kxk matrix A is 
non-singular and |x’| is sufficiently small. 


Theorem C.2.1. Let v=v,g+w be a C® vector field in a neighborhood of 0 in 
IR" such that vy and w satisfy (C.2.1) and (C.2.2), and Red>0 if A is an 
eigenvalue of A. Let g, h be C” functions with g(0)>0. Then the equation 


(C.2.3) gu+vu=h 
has a unique C® solution in a neighborhood of 0. 


In the proof we shall first construct u so that gu+vu—h vanishes of 
infinite order when x’ =0. This requires the following lemma. 


Lemma C.2.2. Let A be a kxk complex matrix with eigenvalues 4,,...,A,- 
Then the eigenvalues of <Az,6/0z> as a linear transformation in the space Tm 
of homogeneous polynomials of degree m in (z,, ...,Z,) are the sums 


k 
eae STE 4,20 and Yaj=m. 


Proof. The matrix of (Az, 0/0z> operating on z, is the adjoint of A so it is 
obvious that it has the eigenvalues 1,,...,/,. By a linear change of the z 
variables we can put A in upper triangular form, thus A,,=0 when j>k and 
A,,=A,. Thus 


<Az, 0/02) 2*={a,d> 27+Rz", R= V Aq z,0/02;. 
j<k 


The monomials z* with |cj=m form a basis in z,,. If we order them 
lexicographically it is clear that R increases the lexicographical order so 
«Az, 6/0z> as an operator in z,, has upper triangular form and the diagonal 
elements <a, 2>. The proof is complete. 


In particular, the lemma shows that <«Az,0/éz> is bijective on z,, for 
every m+0 if 0 is not in the convex hull of the eigenvalues. 

To pass from formal to genuine solutions of (C.2.3) we shall study the 
orbits of v by means of the following lemma. 


Lemma C.2.3. If A is a linear transformation in a real k dimensional vector 
space V and all the eigenvalues are in the open right half plane, then there is a 
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Euclidean scalar product ( , ) in V such that for some c>0 
(Ax, x)2c(x,x), xeV. 


Proof. Let | | be a Euclidean norm in Vand set 
Ix|?=fle“tx|?dt, xeV. 
0 


The integral is convergent since e~“'x is exponentially decreasing, and it is 
clear that ||x|| is a Euclidean norm. Now 


ie) 0 
leASx|j2—= f jew A'x|? dt= |x|]? + f lev 4*xi? de. 


If (, ) is the scalar product associated with || ||, it follows when s—0 that 
2(Ax, x) =[x|? 2 2c |x|’, 
which proves the lemma. 


Proof of Theorem C.2.1. We shall first find a formal power series 


wo 


> us (x’, x"), 
0 


where u” is a homogeneous polynomial of degree wu in x’ with coefficients 
C® in x”, which satisfies (C.2.3) of infinite order when x’=0. The term of 
degree yp in the equation is 


(2(0, x”) +<Ax’, 0/0x’>) u*(x’, x V+R,(x', x") =0 


where R,, is determined by the data and u°,...,u*~'. Since g>0 it follows 
from Lemma C.2.2 that these linear equations for the coefficients of u” have 
a unique solution. By Theorem 1.2.6 we can find uyeC® with the Taylor 
expansion )\u"(x’, x”) with respect to x’ when x’=0. The equation (C.2.3) 
can be written 
g(u—uy) +v0(u—uy)=h—guy —vUuy 

where the right-hand side vanishes of infinite order when x’=0. Hence it 
remains only to prove Theorem C.2.1 when h vanishes of infinite order for 
x'=0, and we know that if a solution exists it must also vanish of infinite 
order then. 

To solve (C.2.3) exactly we must examine the orbits of the vector field v, 


(C.2.4) dx'/dt=Ax'+w'; 9 dx" /dt=w’". 
By Lemma C.2.3 we may assume that (Ax’, x’)2c(x’, x’). Hence 
(dx'/dt,x')2co\\x'|?,_ ||dx"/dt|| Seq ||x’| 


if 0<cy<c and |x’|+|x”| is small enough. Thus d||x’||/dt 2c, ||x’|| so there is 
an orbit x(t), t<0, with x(0)=y given sufficiently small, and 


Ix'isiyes Ix’ Ol+lx"OlSiy'l+ly"| if ts. 
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We denote the solution by x(t, y) and observe that (C.2.3) can be written 
(C.2.3) (gu) (x(t, y)) +du(x(t, y))/dt=h(x(E, y)). 
Set ; 

G(t, v=Lalx(s, y))ds, t<0. 


Then |G(t, yi<C\t|. If the function h in (C.2.3) vanishes of infinite order 
_when x’=0 and a C® solution u exists, we have seen that it must also 
vanish of infinite order when x’=0 so we must have 


(C.2.5) u(y) = { h(x(t, y)) e@@ dt. 


The integral converges since h(x(t), y)=O(|y'|Ne%*“) for any N, and it is clear 
that (C.2.5) gives a solution of (C.2.3)'. By differentiation of (C.2.4) with 
respect to the parameters y it follows successively that the derivatives of 
x(t, y) with respect to y are at most exponentially increasing in t. Hence 
ueC® which completes the proof. 


Remark. Although the perturbation w plays a small role in the preceding 
proof it is not always possible to remove it by a change of variables, for 
there might be resonances among the eigenvalues of A. 


C.3. Clean Intersections and Maps of Constant Rank 


First recall that two C® submanifolds Yand Z of a C® manifold X are said 
to intersect transversally at x,eEYOZ if T,(Y)+T,(Z)=T,(X) when x=Xp. 
An equivalent condition is of course that N.(Y)ON,(Z)= {0}. If Yis defined 


locally by f,=...=f,=0 where df,,...,df, are linearly independent at x, 
and Z is similarly defined by g,=...=g,=0, it follows that 
af,,...,4f,,dg,,...,dg, are all linearly independent at x . Thus the in- 
tersection YAZ defined by f,=...=f,=2,=...=g,=0 is a C® manifold in 
a neighborhood of x, and 

(C.3.1) codim (Yn Z)=codim Y + codim Z, 

(C.3.2) T(AYAZ)=T(Y0T{Z), xeYoZ. 


Now the intersection YZ may very well be a manifold satisfying 
(C.3.2) even if the intersection is not transversal. An example is given by any 
two linear subspaces Y and Z of a vector space X. In that case (C.3.1) is 
replaced by 


(C.3.3) codim Y + codim Z=codim(YnZ)+e 


where e=codim(Y+Z). This example is in fact quite general: 
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Proposition C.3.1. Let Y, Z and YOZ be C® submanifolds of a C® manifold 
X, and assume that (C.3.2) is valid. For any xy¢YOZ one can then choose 
local coordinates in a neighborhood such that Y and Z are defined there by 
linear equations in the local coordinates. Thus (C.3.3) is valid with an integer 
e220 vanishing precisely when the intersection is transversal. 


Proof. We choose local coordinates such that x,»=0, Y is defined by'x, =... 
=x,=0, and YoZ is defined by x,=...=x,,,=0 where v=codim Y and 
v+p=codim(YZ). The manifold Z is defined by N=codimZ equations 
F(x)=0 with linearly independent differentials. Since F,=0 when x,=... 
=X,,,=0, we can write (Theorem 1.1.9) 


V+ 


Fi(x)= Fx) x 


where F,¢C®. Thus 
vtph 
dF,(x)= ¥) Fy(x) dx, 


k=1 
in YZ. By hypothesis these equations together with dx,=...=dx,=0 
define the tangent plane of Yo Z, so the matrix 


(Fi,(x)) j 1,..,Nsk=vt1,..,v+p 


is of rank pu. Suppose that the determinant with j=1,...,u is not 0. In a 
neighborhood of 0 we can then by elimination find a C® matrix (a;,) such 
that 


Gi(x) = Fils) ~ Yo ay(2) Fj) = Gale), P= HHL... 
j= = 


In a suitably small neighborhood of 0 the equations G,x)=0 with 
i=pt+l,...,N define a manifold W of codimension e=N—yp containing Y 
and Z. The codimensions of Y, Z and YoZ in W are respectively v—e, N—e 
and v+yu—e=v—e+N—e, so (C.3.1) is fulfilled, that is, Y and Z intersect 
transversally as submanifolds of W. If we choose new local coordinates y 
=(y’,y’) so that W is defined by y’'=0, the statement is now reduced to the 
transversal case where we have already observed that it is true. 


Definition C.3.2. If the intersection YAZ of two C” submanifolds of a C® 
manifold X is a C® submanifold satisfying (C.3.2) then the intersection is 
said to be clean and the non-negative integer e in (C.3.3) is called its excess. 


When using clean intersections it is useful to choose coordinates accord- 
ing to Proposition C.3.1 which allows one to apply statements from linear 
algebra. 

Now consider two C® manifolds X and Y and a C® map f: Y>X. If 
y ge and f’(y,) is injective, then a neighborhood of y, is mapped by f on a 
submanifold of X of dimension dim Y. On the other hand, if f’(yo) is 
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surjective, we obtain a local fibration of Y over X with fibers f—+(x). This 
follows from the implicit function theorem. We shall now discuss the more 
general case where f’ has constant but arbitrary rank. 


Proposition C.3.3. Assume that Y and X are C® manifolds and that f: Y-+ X 
is a C® map such that f’ is of constant rank r in a neighborhood of yoeY. 
Then one can choose local coordinates X,,...,X,_ and Y,,...,Y¥, at f(Yo) and yo 
so that f(y)=(y1, ---)¥,0,.--, 0). Thus f defines locally a fibration of Y over a 
submanifold of X. 


Proof. We may assume that XcIR”, that f(y.)=0, and that for f 
=(f,,...,4,,) the differentials of f,,...,f, are linearly independent at y,. 
Coordinates can then be chosen in Y so that y)=0 and fi(y)=y,, j=1,...,7. 
Then 0f,/0y,=0 when j>r and k>r, for the rank of Of/¢y would otherwise 
exceed r. Hence f.,,,...,f,, are function of y,,...,y, only, which means that 
SIW=fO 4, +++. Vp 0,---, 0). The range of f is therefore a submanifold of X of 
dimension r. Now we change the coordinates in X so that it.is defined by 
X44, ---=X,,=0 and start the proof over again. Then f,,,=...=f,,=0 
which proves the statement. 


C.4. Folds and Involutions 


If Yand X are C® manifolds and f: Y>X is a C® map, then there is a 
well defined Hessian map going from Kerf’(y)<T,(Y) to Cokerf’(y) 
= Tyyy(X)/f'(y) T,(Y) for every yeY. Indeed, if ¢: R>Y is a C® map with 
o(0)=y, 6'(0)=neKer f'(y) then we have in terms of local coordinates 


FON) -FWM=F'W(O() —Y) +f’ W)G(S) —y), (9) —y>/2 + O(8*) 
=S*(f'(y) 9" (O)+<F"O) n, ndy/2 + (8°). 


The term f’(y) 6’ (0) drops out when we take the image in Coker f’(y) which 
shows that 


Ker f'(y)ant>< f(y), m>€Coker f'(y) 
is an invariantly defined quadratic form, called the Hessian of f 


In the following definition we introduce the class of maps which fail to 
be diffeomorphisms in the simplest possible way. 


Definition C.4.1. If Y and X are C® manifolds and f: Yo X is a C® map, 
then f is said to have a fold at y,€Y if dim Ker f(y.) =dim Coker f'(y))=1 
and the Hessian of f at y, is not equal to 0. 


Note that the hypothesis implies that Y and X have the same dimension. 
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Theorem C.4.2. If f: Y>X has a fold at y, then one can choose local 
coordinates y,,...,y, in Y which vanish at y, and local coordinates x,,...,x 
in X which vanish at f(y.) such that f is defined by 


n 


SO. oo VJ=Vn, st+9 Vn 49 ye). 


Proof. Choose local coordinates in X such that for the resulting representa- 
tion f=(f,,...,f,) we have df,(y,.)=0. Then df,,...,df,_, are linearly inde- 
pendent at yy so we can choose local coordinates in Y with y,=f;, j<n. 


Then 
fWY=01, mre Pere no) 


df,(0)=0 and 6?f,(0)/dy?+0 by hypothesis. The implicit function theorem 
shows that the equation 

Of,/OY,=0 
has a unique local solution y,=g(y), y =(),;,---,¥,_1), such that g(0)=0. 


Replacing the variable y, by y,—g(y’) we obtain new coordinates such that 
of,/6y,=0 when y, =0, that is, 


Fu) = fal, 0) + Yn FY) 


where FeC® and F(0)+0 (Taylor’s formula). If we take x,—f,(x’,0) as a 
new variable instead of x,, we have in the new coordinates 


SD=Op0 + Yn VE (y). 


Changing the sign of x, if necessary we may assume that F>0. We can then 
complete the proof by taking y, F(y)? as a new coordinate instead of y,. 


The theorem explains the term “fold” if we observe that when n=1 the 
graph {(f(y), y)}} cX x Y is a parabola folded around the x axis. 


Corollary C.4.3. If f: YX has a fold at y then there is in a neighborhood V 
of Yo a unique C® map i: V>V with foi=f which is not the identity. 


Proof. In the local coordinates of Theorem C.4.2 there is precisely one map 
with this property, and it is given by (y’, y,)>(y’, —),)- 


The map i is an involution, that is, ici is the identity. This follows from 
the explicit representation or else from the uniqueness: If foi=f then foici 
= foi=f so ioi=identity since ioi+i. We shall call i the involution defined 
by the fold. Its fixed point set is the hypersurface where f’ is not bijective, 
defined by y,=0 in the local coordinates. The importance of the involution 
is due to the following result: 


Theorem C.4.4. Let ueC® in a neighborhood of y,€Y, where f has a fold. 
Then one can find veC® in a neighborhood of f(y.)eX with f* v=u, if and 
only if i*u=u. 
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Proof. lf u=f*v then i*tu=(foi)*v=f*v=u, so the necessity is clear. To 
prove the sufficiency we work in the local coordinates of Theorem C.4.2. 
The function u is then even in y, so in the formal Taylor expansion 


Lah uly’, 0) vali! 
the terms with j odd must vanish. Choose v,EC® according to Theorem 
1.2.6 so that v, has the Taylor expansion 


Y a4 u(x’, 0) xi/(2j)!. 
Then u,(y)=u(y) —v,(y’, y2) vanishes of infinite order when y,=0. Hence 
vy (x)=u,(x', xf), x,>0; 0 (x)=0, x, SO, 
is a C® function, for all derivatives tend to 0 when x, 0 since 
D*u,(x',x2)x7%-0, x,>+0, 


for all a and N. Now v=v,+0, has the required property. 


Locally every involution leaving a hypersurface fixed can be defined by a 
folding map: 


Theorem C.4.5. Let Y be a C® manifold and i: YY a C® involution with 
fixed point set equal to a hypersurface containing y,¢Y. Then one can choose 
local coordinates y,,..., y, at Yo vanishing at y, such that 


iV, oo MI=V1> +09 Vn—1> —Y,): 


Proof. If ueC”(Y) then i*u+u and i*u—u are respectively even and odd 
under the involution. When wu runs through a system of local coordinates it 
follows that we can pick out from the functions obtained a local coordinate 
system y,,...,y, where every coordinate is either even or odd, that is, i*y,= 
+y,. The codimension of the fixed point set is the number of odd coor- 
dinates so it must be equal to 1. 


On the hypersurface of fixed points the involution defines a line bundle 
in the tangent bundle consisting of the eigenvectors of the Jacobian map 7’ 
with eigenvalue —1. Such vectors are of course transversal to the hyper- 
surface. We shall refer to the bundle as the reflection bundle of the in- 
volution. If i is defined by a folding map f then the bundle is the kernel of 
the differential of f 

We shall now examine when two involutions with the same fixed hyper- 
surface can be simultaneously put into a simple form. 


Theorem C.4.6. Assume that f and g are non-trivial C® involutions in Y with 
the same hypersurface S of fixed points and with linearly independent 
reflection bundles at y,éS. Then there exist local coordinates y,,..., y, vanish- 
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ing at y, such that 


(C.4.1) LY ID=Vs ~Vd — BV ID = LY Vd + In et 
where e, =(1,0,..., 0). 


We could of course have taken any other vector +0 with last com- 
ponent equal to 0 instead of e,. Note that the reflection bundles for these 
involutions are generated by e, and by e, —e,/2 where e,=(0,...,0, 1). 


Proof of Theorem C.4.6. We choose local coordinates y,,...,y, such that f 
has the desired form. Since g is an involution leaving the plane y,=0 
pointwise fixed, the Jacobian matrix g’ at (y’,0) must agree with the identity 
matrix except in the last column which is (a,(y),...,4,_,(’), —1). It maps 
(y',y,) to (4’,0)+n,(a,(y), ..-,4,_,(¥), —1) which shows that the reflection 
bundle is generated by (a,(y), ...,4,_1(¥), —2). By hypothesis some a, is 
therefore +0. We want to introduce new coordinates preserving the simple 
form of f but making g come closer to the desired form 


o> WW =O1 Tn Yao +009 Vn 4 yy. 


So far we only know that with the notation a=(a,,...,a,_,) and some 


AEC? 
g(y=f)+y, (ay), y, AW) +02. 


Here and below the notation O, means that the first n—1 components are 
O(y4) and the last is O(y**?). This classification of errors will turn out to be 
convenient when the variables are changed so that the plane y,=0 is re- 
spected. Now we introduce new coordinates 


u(y) = (4, (9), «5 Mp1 Vs Yn Ua Y))- 
This preserves the form of f, and we have 


Soo UY) =(U(Y) +p UV), U2 (V), «5 U1) —In UY) 
ucg(y)=(u,(V +y,a(y)), ---5U,_i1 0’ +y, ay), 
(—y,+ yz Aly’) u(y +y, ay) +02. 


These compositions are equal apart from an O, error if 


{u,@>=u,; <uj,a>=0, j=2,....n-1;  <u,,a>=Au,. 


These differential equations can be solved locally with initial data u, =0, u, 

=1 and u, equal to a system of local coordinates on a hypersurface 

transversal to a in the plane y,=0. (In particular this means that we put the 

foliation of S, defined by the intersection of the tangent space of S and the 

sum of the reflection bundles, in the form where the leaves are along the y, 

axis.) Taking u,,...,U,_1,¥,U, aS new local coordinates we have achieved 
that g=g,+0,. ~ 
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Assume more generally that we have succeeded to find local coordinates 
such that f has the simple form (C.4.1) and 


(C.4.2) 8) =8o(y) +O, 

for some k=2. We write 

(C.4.3) 8(Y)=8o(V) + Yah), Yn HV) + Ons 1 

where h has n—1 components and H has one. Two cases must be distin- 
guished. 


a) k is even. Since g is an involution we obtain after a simple calculation 
V=So8(Y)=So°8o(W+2V AY), —Yn HCY’) +O, 41. 

It follows that h=0 and that H=0. 

b) k is odd. Set 

u(y)=ytyn (0), ¥n VV). 
Then 
u-*(y)=y—ya* (00), VV VY +O 24-25 

and 2k -22k+1 since k23. A straightforward computation gives 


Zour" (y)=go(y) YOY) +, VY e1, —n VV) 
+I), Yn HV) + Ops 1 
wogou (Y=So(Y— Ye YI FY Ver, —YV VY + nA), Yn HV) 
+I (OY + Ines “VOY tne) I) + Ox 1 
=So(y) + yn(h(y’) + dv(y' Voy, Ver, 
YAH (y')—OV(y/Oy)+ Ons 
Here we have written e, for the first unit vector in R"~'. This is equal to g, 
+0,,, if 
dv/dy,=Ve,—h, oaV/ey, =H, 


and these equations are easily solved. . 
Iteration of the preceding steps leads to a formal power series solution 
with respect to y, of the problem of finding u(y) such that 


1 1 


ucefou~’=f and ucgou~*=gpo. 


By Theorem 1.2.6 we can choose a C® function u with this Taylor expan- 
sion. Introducing the new coordinates u we have then reduced the proof to 
the case where f is exactly of the form (C.4.1) and g—g, vanishes of infinite 
order when y, =0. 


As in the proof of Theorem C.2.1 we switch now to another argument in 
order to eliminate this small error. First note that if w is one of the desired 
coordinate functions other than y, then it must be even (odd) both with 
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respect to f and with respect to g, thus 

(C.4.4) wof=wog, 

Conversely, if we have a function w satisfying this equation then 
wtwof=wtwog 


is even (odd) both with respect to f and g. Thus we look for solutions of 
(C.4.4) or rather the equivalent equation 
(C.4.4)’ wod=w, O=fog. 
From the first part of the proof we have that 
HAY =Yt+Yne, + ply) 


where p vanishes of infinite order when y,=0. Set w,= y; for some j+1 and 
set W=W, —W,. Then (C.4.4)' is equivalent to 


w,—w,°d=h; h=wy—woed. 
Here A vanishes of infinite order when y,=0. Choose h,eCf equal to A ina 
neighborhood of 0. It suffices to solve the equation 
w,—w,cd=h, 
which. is formally done by the infinite series 


(C.4.5) Wy =hy+hyod+hyod0d+.... 


We shall prove below that (C.4.5) converges and defines a C” function 
vanishing of infinite order when y,=0, for every h,eC? which vanishes of 
infinite order. Accepting this for a moment we replace the coordinate 
functions y,,...,y, by the modifications just constructed, which are even 
(odd) with respect to both f and g. With the new coordinates f still has the 
form (C.4.1), since y, is also even with respect to f/ and 


s(y)=800) +e, RO) 
where R vanishes of infinite order when y,=0. Now put wy»=y,. We have 
Wo Woo d= —y, +hp(y) 


where h, vanishes of infinite order when y,=—0. Hence we can solve the 
equation 
w,—w,°d=h, 

as before in a neighborhood of 0. Then w=w,—w, satisfies the equation w 
—wed=-—y,, and w—y, vanishes of infinite order when y,=0. The equa- 
tion w—wefog=-y, can be written weg—wef=y,. Hence w= 
(w+wof)/2 is even with respect to f while Wog=w+y,. When W is taken as 
new y, coordinate both f and g have the desired form. 
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It remains to prove the convergence of the series (C.4.5). Put 5* 
=60,..06 with k “factors”. Then é*(y)=y+ky, e, +&(y), 


41 = 6 +E, 2, +p(ytky,e;+&) 
where E, is the n'® component of ¢,. Thus &)=0 and for some constant C 
lege 1 —& — Ey esl S Clly,| +1E,)*. 
If |E,Sly,| when j<k it follows by taking the n® component that 
\Ej41-E,j|S16Cyf, jSk, 
[Ex 4: 1=lEx41 El S16Ckyp Sy; Sly,| if K<(16CyZ)~*, ly, <1. 
Since this implies 
lé;41—€lSy2+16Cyt, j<(16Cy?)-', 
we obtain 
\5*(y)—y—ky, e|=lelSkyn(l+1l6Cyg)<2 if k<(16Cyz)~?, 


provided that 16C>1 and |y,|<1. Thus the iterates move almost as if there 


were no error term. 


Let 
w={y3 |lyl<n,lyl<1} 


where 7 is small, and let |y|<M in supph,. If yew we have |5*(y)|>M if 
Iky,|>M+3 and k<(16Cy?)~'; such a k exists if 


1+(M+3)/ly,|<(L6C yn)~*. 
This is true for all yew if we choose 4 so small that 
. (M +3)n+n?<(16C)-. 


Breaking off the series (C.4.5) at the first such integer we still have a C® 

solution of (C.4.4)' in w when y,+0. The number of terms is O(1/ly,]|). It 

remains to give bounds showing that all derivatives tend to 0 when y, -0. 
Let N be any positive integer and set 


w= 2, Fl 


Then we have with a constant C, depending on N 
lex tly Slély + lLEgly + Cylyl? (1 + lel)” 
IE, ilvS |E, ln + Cyly, |? (1 + legly)”. 


We want to show that |é,|,< yz, JSk, if y, is small and k is the number of 
terms which we have estimated by O(1/ly,|). This would imply that |w,ly 
=O(y,) and prove that w,eC®(q@) with all derivatives 0 when y,=0. As- 
sume that we know that |e|y<y7 for all j<J<k. If y, is small it follows 


that 
IE js lw SlEjlyt+2Cyly,l*, isJ, 
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and adding these inequalities we obtain since E,=0 
( BylyS2Cykly I <Cyys, fSI+1. 
But this implies that 
lej4 sly Slejly + Cy Ve if jSJ, 
and adding these inequalities we obtain if y, is small 
lelwSCykyp<yn if jSJ +1. 


By induction it follows that |éj|y¥< y? if jSk and y, is small, and this finishes 
the proof. Note that the estimates have followed the simple principle that 
the n‘* components of the iterations are estimated first since they hardly 
change at all. The other components are easy to estimate afterwards. This 
ends the proof of Theorem C.4.6. 


Corollary C.4.7. Let the hypotheses of Theorem C.4.6 be fulfilled and let v be 
a tangent vector of S at y, which is not in the span of the reflection bundles. 
Then there is a hypersurface Y,<Y through y, which is transversal to v and 
is invariant under f and g so that they induce involutions in Y, with all the 
properties assumed in Y. 


Proof. With the local coordinates in Theorem C.4.6 the hypothesis on v is 
that v=(v,,...,0,_ 1,0) with v,+0 for some j+1. The surface 


n—1 
ly yy Vj 0} 


is then transversal to v and clearly the involutions f and g have basically 
the same form there. 


It is now easy to prove a homogeneous version of Theorem C.4.6. 


Theorem C.4.8. Assume that f and g are non-trivial homogeneous C® in- 
volutions of the conic C® manifold Y having the same fixed point hypersurface 
Y,. Assume that the two reflection bundles and the radial vector are linearly 
independent at y EY). Then n=dim Y 23 and there exist local coordinates 
Yy>---»¥, in a conic neighborhood of yg such that y,,...,y,., are homo- 
geneous of degree 0, y, is homogeneous of degree 1, and 


S Vie 0005 Va) =( Vas Var es Vad 
EV 45 ++ Va) =( Vas Yao 029 Vn a t V1 Vad 


Proof. The surface Y, is conic so the radial vector p is a tangent at yo. By 
hypothesis it is not in the linear span of the reflection bundles so using 
Corollary C.4.7 and Theorem C.4.6 we can choose a hypersurface Y, 
through y, transversal to p(y,) and local coordinates y,,...,y,_, in Y, 
which vanish at y,, such that (y,,...,y,_1) is mapped to (—y4, V2, ---s ¥a_4) 
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by f and to (—y,,¥2,---;¥n_1 +1) by g. Now extend these coordinates to 
polar coordinates with respect to Y,, that is, define y,—1 in Y, and extend 
all y, to homogeneous functions of the stated degree of homogeneity. By the 
homogeneity of f and g we then obtain the stated conclusion. 


C5, Geodesic Normal Coordinates 


Let X be a C® manifold and g(x, €) a real C® quadratic form in T*(X). If g 
is non-degenerate then the dual quadratic form in T(X) is a (pseudo-) 
Riemannian metric and it is well known then how one can introduce 
geodesic normal coordinates with respect to a submanifold. We shall now 
give a symplectic derivation of such results using weaker hypotheses on g. 

Let YcX be a C® submanifold of X such that g restricted to the 
conormal bundle N*(Y) is non-degenerate. This implies that 


V={(y, Og(y,n)/On); (y, n)eN*(Y)} 

is a subbundle of T,(X) of total dimension equal to dim X. The annihilator 
in T;*(X) of Vis the g orthogonal bundle of N*(Y). Thus it is disjoint with 
N*(Y)X0 so V is dual to N*(Y), hence isomorphic to the normal bundle 
Ty(X)/T(Y). We shall deduce a natural diffeomorphism between a neigh- 
borhood of 0 in V and a neighborhood of Y in X by means of this 
identification and the flowout from the Lagrangian N*(Y) along the Hamil- 
ton field of g(x, €)/2, defined in local coordinates by 


H =(02/0€0/0x —0g/0x0/08)/2. 
We shall write @, for exp tH. 


Theorem C.5.1. If U is a sufficiently small neighborhood of the zero section in 
N*(Y) then the Lagrangian ¢, U<T*(X) is a section S over a neighborhood 
Uy, of Y in X. If m5:S— Uy is the restriction of the projection to S, then S 
= {(x,dF(x)),xeUy} where F=(,'n5')*g/2. One can choose local coor- 
dinates (x',x”) in X near any point in Y so that Y is defined by x'=0 and 
with a fixed non-degenerate quadratic form g,(¢') 


(C.5.1) 0g(x, €)/0E=0g,(E)/0E when &’=0 
and x'=t0g,(é')/eg’ for some teR. 
In particular, g(0, x”, €)=g¢9(E')+ g(x", €”) for some quadratic form g,. 


Corollary C.5.2. When Y is a point we obtain coordinates with the origin at Y 
and 


Og(x, C/dE=0g(0,€)/0E when x=tég(0,5/e0é for some teR. 
If we write g(x, €)=<g(x)é, é> with g(x) symmetric, this means that 
(C.5.2) x= g(x) 2(0)-' x. 
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Corollary C.5.3. When Y is a non-characteristic hypersurface, then €' is one 
dimensional and 
a(x, y= +5 +r(x, €") 


if we write €, instead of &' and €" =(€,,...,€,). 


In the positive definite case Corollary C.5.2 gives standard normal coor- 
dinates when we take g(0)=identity. Note that Corollary C.5.3 does not 
assume that g is non-degenerate. 


Proof of Theorem C.5.1. By the homogeneity of g the Hamilton equations 
(C.5.3) 2dx/dt=dg/0é, 2dé/dt= —dg/Ox 
for the orbits of H imply for fixed s+0 that 

2dx/d(st)=g(x, ¢/s),  d(€/s)/d(s t)= —g.(x, ¢/s). 


Hence 
(C.5.4) (x, )=(x(0, 6(.) implies  ¢,,(x, C/s)=(x(0), ¢(0)/s). 
Choose local coordinates (x’,x”) so that Y is defined near 0 by x’=0. Since 


(C.5.4) gives 
14 (x,66)=2o,(x, 6) =x +e/2 02/6 + O(c”) 


it follows that 
(C.5.5) np (x, E)=x+O(g/2)/0E+O(E), €>0. 


By hypothesis the map +> 0g(0, €,0)/0€’ is surjective so the implicit func- 
tion theorem shows that 2 @, is a local diffeomorphism at the zero section 
of N*(Y)={(0, x”, €’,0)}. Choose a neighborhood U with convex fibers so 
small that 2@, is a diffeomorphism on U. Since ¢, is canonical we know 
that S=@, U is Lagrangian. We shall compute the differential €dx restrict- 
ed to S in terms of the coordinates in U. To do so we compute 


0 0 
Bp Pi CaN = OF a Os AX). 0 = Pr (AE/dtdx + ¢ d(dx/dt)). 


The parenthesis is equal to 


(-dg/dxdx +E d(g/0£))/2=( a g/0¢ 0x —Ag/Ox)dx +E 0° g/A6? dé)/2 
=(dg/0xdx + dg/déde)/2=dg/2 


by Euler’s identity for homogeneous functions. Now g is constant. on the 
orbits of H so we obtain @*(€dx)=dg/2, for P%(Edx)=0 since Edx=0 on 
N*(Y). Thus S is the graph of dF as stated. 

We can choose our local coordinates so that the vector bundle V is 
tangent to the fibers x” =constant when x’=0. In fact, since V is transversal 
to T(Y) it is defined by an equation of the form dx” =f(x’)dx' where f is a 


$02 Appendix C. Some Tools from Differential Geometry 


smooth matrix. If we replace the coordinates x” by x’’—f(x”)x’ the desired 
property is obtained. It means that 0? g/d’ dé” =0 on N*(Y). 

With our local coordinates the quadratic form g restricted to N*(Y) is a 
non-degenerate form in &’ depending on the parameter x”. By completion of 
squares we obtain a local change of variables y’=T(x")x’, y’=x” which 
makes it independent of x”. Thus there is a non-degenerate quadratic form 
8 {n’) such that 2(0, y’,4)—g(7’) is a form in the y” variables. Now consider 
the diffeomorphism from V to X 


(go(n')/2, yon 6,0, y", 7,0); (ys eU. 


By (C.5.5) the Jacobian matrix is the identity when n’=0. We can change the 
coordinates in X so that the map becomes the identity, without affecting the 
conditions previously imposed on the coordinates at Y. Then the function F 


is defined by 
F(go(1')/2, ¥') = 8 o(n')/2. 


If Gp is the dual quadratic form of g, then y'= =Boltt)/2 is equivalent to 7’ 
=Gi(y)2, thus F’(y)=(Go(y’)/2, 0) or 


(C.5.6) F'(go(n')/2, ¥') =(1', 0) 
By (C.5.4) we have with our present coordinates 
1 ,(0, y", n',0)=2 G10, y’,t1',0)=(¢ g0(9')/2, y’). 
Thus the solution of the Hamilton equations (C.5.3) with initial values 
(0,y",7',0) is x=(tgo(y’)/2,y"), and when t=1 we have €=F'(x)=(n’,0). 
Since the Hamilton equation for dx/dt gives for t=1 
(80(1')/2, 0) =3 Og (8o(n')/2, ys 8)/0€, 


this completes the proof of Theorem C.5.1. 


C.6. The Morse Lemma with Parameters 


The advantage of the Morse lemma is that it shows that a C®” function is 
equivalent to a polynomial near a non-degenerate critical point. We need 
the result with parameters: 


Lemma C.6.1. Let f(x,y) (xelR", yeIR®) be a real valued C® function in a 
neighborhood of (0,0). Assume that f/(0,0)=0 and that A=f{).(0,0) is non- 
singular. Then the equation f{(x,y)=0 determines in a neighborhood of 0 a 
C@ function x(y) with x(0) = 0, and we have in a neighborhood of (0,0) 

F(x, Y)=F(x(y), y) + (Az, 29/2 


where z=x —x(y}+ O(|x —x(y)] (x|+ yl) is a C® function of (x, y) at (0,0). 
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Proof. By the implicit function theorem the equation f{(x,y)=0 has a 
unique solution x(y) near 0. Introducing x —x(y) as a new variable instead 
of x we may assume that f/(0,y)=0 for small y. By Taylor’s formula we 


have 
S(x,y) -F0, yy=¥ bx, y) x; ,/2, 
1 
bulrsy=2[(1-10,0,f ex, nde 


Thus B=(b,,) is a C® function of (x, y) with values in the finite dimensional 
vector space of symmetric nxn matrices, and B(0,0)=A. Write z= R(x, y)x 
where R is an xn matrix to be determined so that R(O,0)=I and 


(C.6.1) R* AR=B. 


The equation is valid when x=y=0 and R=I, B=A. The differential of the 
map Rr>R* AR is then 


Rre+R*A+AR. 


It is surjective for if C is a symmetric matrix we have R*A+AR=C when 
R=A7'C/2. Hence the inverse function theorem shows that (C.6.1) is ful- 
filled when R=F(B) where F is a C® function defined for symmetric nxn 
matrices close to A with arbitrary nxn matrix values and F(A)=I. The 
lemma follows with z= F(B(x, y)) x. 


Lemma C.6.1 could have been used to derive Theorem 7.7.6 from Lem- 
ma 7.7.3 although it is hard to calculate the operators L, ,, there in that 
way. Another application occurs in Section 22.3: 


Lemma C.6.2. Let f be a non-negative C® function with f(0)=0 defined in a 
neighborhood of OeIR*, and let n be the rank of f’(0). Then one can find C® 
functions c,,...,c, and g vanishing at 0 such that dc,,...,dc, are linearly 
independent at 0, g=0, g’(0)=0 and 


(C62) fadedtg 


Proof. Since f(0)\=0<f we have f’(0)=0. By a linear change of coordinates 
we can make sure that det f.(0)+0 if x=(x,,...,x,) are the first n coor- 
dinates. Let y = (y1,...,¥e¢-n) be the others. If we apply Lemma C.6.1 and 
introduce z, y as new variables, we obtain 


f(x)=8(y)+¢Az,z> 


where A is positive definite and g20. The Hessian of f in these coordinates 
is the direct sum of A and of g”(0) so g”(0)=0 by the definition of n. If we 
write <Az,z> as a sum of squares and return to the original variables, the 
lemma is proved. 
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Notes 


For Sobolev spaces H,,. the results discussed in Section B.1 are so well 
known that we shall not try to trace their origins. In order to have 
appropriate spaces for an intrinsic definition of Lagrangian distributions we 
have also included a very limited discussion of Besov spaces. For a more 
systematic presentation we refer to Peetre [4]. The results in Section B.2 for 
a manifold with boundary have been taken from Peetre [1] via the prede- 
cessor of this book. 

We shall also refrain from historical comments on the majority of the 
scattered topics discussed in Appendix C. The only recent result is Theorem 
C.4.6. It has been isolated from the proof of Theorem 21.4.12 in Melrose [2] 
as the non-symplectic part of his argument. The rest of it was given in 
Section 21.4. 
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